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In this paper we consider three different kinds of domain-dependent control knowledge (temporal, 
procedural and HTN-based) that are useful in planning. Our approach is declarative and relies 
on the language of logic programming with answer set semantics (AnsProlog*). AnsProlog* is 
designed to plan without control knowledge. We show how temporal, procedural and HTN- 
based control knowledge can be incorporated into AnsProlog* by the modular addition of a small 
number of domain-dependent rules, without the need to modify the planner. We formally prove 
the correctness of our planner, both in the absence and presence of the control knowledge. Finally, 
we perform some initial experimentation that demonstrates the potential reduction in planning 
time that can be achieved when procedural domain knowledge is used to solve planning problems 
with large plan length. 

Categories and Subject Descriptors: 1.2.8 [Artificial Intelligence]: Problem Solving, Control 
Methods, and Search — Plan generation; 1.2.3 [Artificial Intelligence]: Deduction and Theo- 
rem Proving — Logic programming; 1.2 A [Artificial Intelligence]: Knowledge Representation 
Formalisms and Methods — Representation languages 

General Terms: Planning, Control Knowledge, Answer Set Planning 
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1. INTRODUCTION AND MOTIVATION 

The simplest formulation of planning - referred to as classical planning ~ entails 
finding a sequence of actions that takes a world from a completely known initial 
state to a state that satisfies certain goal conditions. The inputs to a corresponding 
planner are the descriptions (in a compact description language such as STRIPS 
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[Fikes and Nilson 1971]) of the effects of actions on the world, the description of the 
initial state and the description of the goal conditions, and the output is a plan (if 
it exists) consisting of a sequence of actions. The complexity of classical planning 
is known to be undecidable in the general case [Chapman 1987; Erol et al. 1995]. It 
reduces to PSPACE-complete for finite and deterministic domains [Bylander 1994]. 
By making certain assumptions such as fixing the length of plans, and requiring 
actions to be deterministic the complexity reduces to NP-completc. 

The ability to plan is widely recognized to be an important characteristic of an 
intelligent entity. Thus, when developing intelligent systems, we often need to in- 
corporate planning capabilities, despite their inherent complexity. Since the com- 
plexity is due to the exponential size of the search space, planning approaches that 
overcome this complexity require efficient and intelligent search. This is at the 
crux of tliKX' common and successful approaches to planning: (i) using heuristics 
[Bonet and Geffner 2001; Hoffmann and Nebel 2001; Blum and Furst 1997] that 
are derived from the problem description, (ii) translating the planning problem 
into a model finding problem in a suitable logic and using model finding techniques 
for that logic [Kautz and Selman 1998a], and (iii) using domain-dependent control 
knowledge^ [Bacchus and Kabanza 2000; Doherty and Kvarnstom 1999; Nau et al. 
1999]. The use of domain-dependent control knowledge has led to several successful 
planners, including TLPlan [Bacchus and Kabanza 2000], TALplan [Doherty and 
Kvarnstom 1999] and SHOP [Nau et al. 1999], which have performed very well 
on planning benchmarks. Strictly speaking, planners that use control knowledge 
are no longer considered to be classical planners since they require the addition of 
domain-dependent control knowledge to the problem specification. Nevertheless, 
such planners are predicted to be the most scalable types of planning systems in 
the long term [Wilkins and desJardines 2001]. In this paper we integrate the sec- 
ond and the third approaches identified above by translating a planning problem 
with domain-dependent control knowledge into a problem of model finding in logic 
programming. 

We integrate domain-dependent control knowledge into our planner in such a way 
that planning can still be performed without this extra control knowledge. The 
control knowledge may simply improve the speed with which a plan is generated 
or may result in the generation of plans with particular desirable characteristics. 
In this respect^ our approach is similar in spirit to the planning systems TLPlan 
[Bacchus and Kabanza 2000], and TALplan [Doherty and Kvarnstom 1999], but 
differs from typical Hierarchical Task Network (HTN) planners (e.g., SHOP [Nau 
et al. 1999]) because HTN planners require integration of domain-dependent control 



'^This is alternatively referred to in the literature as 'domain-dependent knowledge', 'control knowl- 
edge', 'domain knowledge', and 'domain constraints'. We also sometimes use these shortened terms 
in this paper. 

^We differ from [Bacchus and Kabanza 2000] in another aspect. Unlike in [Bacchus and Kabanza 
2000] where the domain knowledge is used by the search program thus controlling the search, our 
domain knowledge is encoded as a logic program which is directly added to the logic program 
encoding planning. In such an approach there is no guarantee that the added rules will reduce 
the search during answer set computation; although our experimentation shows that it does for 
large plan lengths. The paper [Huang et al. 1999] also comments on this aspect. 
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knowledge into the specification of the planning problem. As such, HTN planning 
cannot be performed without the existence of this knowledge, unlike our planner. 

In this paper, we explore three kinds of domain control knowledge: temporal knowl- 
edge, procedural knowledge, and HTN-bascd knowledge. Our treatment of tempo- 
ral knowledge is similar to that used in both the TLPlan and TALplan systems. Our 
formulation of procedural knowledge is inspired by GOLOG, referred to alterna- 
tively as a logic programming language, or an action execution language [Lcvcsquc 
et al. 1997]. Although our procedural knowledge is similar to the syntax of a 
GOLOG program, how this knowledge is used in planning is quite different. Simi- 
larly, our formulation of HTN-bascd knowledge is inspired by the partial-ordering 
constructs used in HTN planners, but our use of this type of knowledge during 
planning is very different from the workings of HTN planners. The main difference 
is that both GOLOG programming and HTN planning rely on the existence of 
domain-dependent control knowledge within the problem specification and cannot 
perform classical planning in the absence of this knowledge. In contrast, our ap- 
proach, which is similar to the approach in [Bacchus and Kabanza 2000], separates 
the planner module from the domain knowledge (encoding temporal, procedural, 
or HTN-based knowledge), and can plan independent of the domain knowledge. 

To achieve our goal of planning using domain-dependent control knowledge, an 

important first step is to be able to both reason about actions and their effects on the 
world, and represent and reason about domain-dependent control knowledge. This 
leads to the question of choosing an appropriate language for both reasoning and 
representation tasks. For this we choose the action language B from [Gelfond and 
Lifschitz 1998] and the language of logic programming with answer set semantics 
(AnsProlog*) [Baral 2003], also referred to as A-Prolog [Gelfond and Leone 2002]. 
We discuss our choice on B in Section 2. We selected AnsProlog* over other action 
languages for a number of important reasons, many of which are listed below. These 
points are elaborated upon in [Baral 2003]. 

— AnsProlog* is a non-monotonic language that is suitable for knowledge repre- 
sentation. It is especially well-suited to reasoning in the presence of incomplete 
knowledge. 

— The non-classical constructs give a structure to AnsProlog* programs and state- 
ments, such as a head and a body, which allows us to define various subclasses of 
the language, each with different complexity and expressivity properties [Dantsin 
et al. 2001] . The subclass of AnsProlog* programs in which no classical negation 
is allowed has the same complexity as prepositional logic, but with added ex- 
pressivity. The most general class of AnsProlog* programs, which allows "or" in 
the head, has the complexity and expressivity of the seemingly more complicated 
default logic [Reiter 1980]. In general, AnsProlog* is syntactically simpler than 
other non-monotonic logics while equally as expressive as many. 

— There exists a sizable body of "building block" results about AnsProlog* which 
we may leverage both in knowledge representation tasks and in the analysis of 
the correctness of the representations. This includes result about composition of 
several AnsProlog* programs so that certain original conclusions are preserved 
(referred to as 'restricted monotonicity'), a transformation of a program so that it 
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can deal with incomplete information, abductive assimilation of new knowledge, 
language independence and tolerance, splitting an AnsProlog* program to smaller 
components for computing its answer sets, and proving properties about the 
original program. 

— There exist several efficient AnsProlog* interpreters [Eiter et al. 1998; Simons 

et al. 2002] and AnsProlog* has been shown to be useful in several application 
domains other than knowledge representation and planning. This includes policy 
description, product configuration, cryptography and encryption, wire routing, 
decision support in a space shuttle and its 'if 'then' structure has been found to 
be intuitive for knowledge encoding from a human expert point of view. 
— Finally, AnsProlog* has already been used in planning [Subrahmanian and Zan- 
iolo 1995; Dimopoulos et al. 1997; Lifschitz 1999b], albeit in the absence of 
domain-dependent control knowledge. In this regard AnsProlog* is suitable for 
concisely expressing the effects of actions and static causal relations between flu- 
ents. Note that concise expression of the effects of actions requires addressing 
the 'frame problem' which was one of the original motivation behind the devel- 
opment of non-monotonic logics. Together with its ability to enumerate possible 
action occurrences AnsProlog* is a suitable candidate for model-based planning, 
and falls under category (ii) (above) of successful approaches to planning. 

As evident from our choice of language, our main focus in this paper is the knowl- 
edge representation aspects of planning using domain-dependent control knowledge. 
In particular, our concerns includes: 

— the ease of expressing effects of actions on the world, and reasoning about them, 
— the ease of expressing and reasoning about various kinds of domain constraints, 

— the ease of adding new kinds of domain constraints, and 

— correctness results for the task of planning using an AnsProlog* representation 
that includes domain constraints. 

We also perform a limited number efficiency experiments, but leave more detailed 
experimentation to future work. 

With the above focus, the contributions of the paper and the outline of the paper 

is as follows: 

(1) In Section 3 we encode planning (without domain constraints) using AnsPro- 
log* in the presence of both dynamic effects of actions and static causal laws, 
and with goals expressed as a conjunction of fluent literals. Wc then formally 
prove the relationship between valid trajectories of the action theory and an- 
swer sets of the encoded program. Our approach is similar to [Lifschitz and 
Turner 1999; Eiter et al. 2000] but differs from [Subrahmanian and Zaniolo 
1995; Dimopoulos et al. 1997; Lifschitz 1999b]. The main difference between 
our formulation and earlier AnsProlog* encodings in [Subrahmanian and Zan- 
iolo 1995; Dimopoulos et al. 1997; Lifschitz 1999b] is in our use of static causal 
laws, and our consideration of trajectories instead of plans. Our trajectories 
are similar to histories in [Lifschitz and Turner 1999] and to optimistic plans 
in [Eiter et al. 2000]. The reason we relate answer sets to trajectories rather 
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than relating them to plans is because in the presence of static causal laws the 
effects of actions may be non-deterministic. 

(2) In Section 4.1 wc show how to incorporate temporal constraints for planning 
into our formulation of the planning problem. Incorporating temporal con- 
straints simply requires the addition of a few more rules, illustrating the declar- 
ative nature and elaboration tolerance of our approac;h. Wc define formulas for 
representing temporal constraints and specify when a trajectory satisfies a tem- 
poral constraint. We then formally prove the relationship between valid tra- 
jectories of the action theory satisfying the temporal constraints, and answer 
sets of the updated program. Our approach differs from [Bacchus and Ka- 
banza 2000; Doherty and Kvarnstom 1999] in that we use AnsProlog* for both 
the basic encoding of planning and the temporal constraints, while the plan- 
ners in [Bacchus and Kabanza 2000; Doherty and Kvarnstom 1999] are written 
in procedural languages. Preliminary experiments show that our approach is 
less efficient than TLPlan and TALPlan. Nevertheless, both these systems are 
highly optimized, so the poorer performance may simply reflect the lack of op- 
timizations in our implementation. On the other hand, our use of AnsProlog* 
facilitates the provision of correctness proofs, which is one of our major con- 
cerns. Neither of [Bacchus and Kabanza 2000; Doherty and Kvarnstom 1999] 
provide correctness proofs of their planners. 

(3) In Section 4.2 we consider the use of procedural domain knowledge in planning. 
An example of a procedural domain knowledge is a program written as ai ; a2 ; 
(as I a.4 I as) ;f?. This program tells the planner that it should make a plan 
where ai is the first action, a2 is the second action and then it should choose 
one of as, 04 or 05 such that after the plan's execution / will be true. 

We define programs representing procedural domain knowledge and specify 
when a trajectory is a trace of such a program. We then show how to incorpo- 
rate the use of procedural domain knowledge in planning to the initial planning 
formulation described in item (1.). As in (2.) the incorporation involves only 
the addition of a few more rules. We then formally prove the relation between 
valid trajectories of the action theory satisfying the procedural domain knowl- 
edge, and answer sets of the updated program. We also present experimental 
results (Section 4.4) showing the improvement in planning time due to using 
such knowledge over planning in the absence of such knowledge. 

(4) In Section 4.3 we motivate the need for additional constructs from HTN plan- 
ning to express domain knowledge and integrate features of HTNs with proce- 
dural constructs to develop a more general language for domain knowledge. We 
then define trace of such general programs and show how to incorporate them 
in planning. Wc then formally prove the relation between valid trajectories of 
the action theory satisfying the general programs containing both procedural 
and HTN constructs, and answer sets of the updated program. To the best 
of our knowledge this is the first time an integration of HTN and procedural 
constructs has been proposed for use in planning. 

(5) As noted above, a significant contribution of our work is the suite of correctness 
proofs for our AnsProlog* formulations. All the proofs appear in Appendix A. 

ACM Transactions on Computational Logic, Vol. V, No. N, February 2008. 



6 • Tran Cao Son, Chitta Baral, Nam Tran, and Sheila Mcllraith 

For completeness Appendix B presents results concerning AnsProlog* that we 
use in the Appendix A proofs. 

In regards to closely related work, although satisfiability planning (see e.g., [Kautz 

and Selman 1992; Kautz et al. 1994; Kautz and Selman 1996; 1998a]) has been 
studied quite a bit, those papers do not have correctness proofs and do not use the 
varied domain constraints that we use in this paper. 

We now start with some preliminaries and background material about reasoning 
about actions and AnsProlog*, which will be used in the rest of the paper. 

2. PRELIMINARIES AND BACKGROUND 

In this section, we review the basics of the action description language B, the answer 
set semantics of logic programs (AnsProlog) , and key features of problem solving 
using AnsProlog. 

2.1 Reasoning about actions: the action description language B 

Recall that planning involves finding a sequence of actions that takes a world from 

a given initial state to a state that satisfies certain goal conditions. To do planning, 
we must first be able to reason about the impact of a single action on a world. 
This is also the first step in 'reasoning about actions'. In general, reasoning about 
action involves defining a transition function from states (of the world) and actions 
to sets of states where the world might be after executing the action. Since explicit 
representation of this function would require exponential space in the size of the 
number of fluents (i.e., properties of the world), actions and their effects on the 
world are described using an action description language, and the above mentioned 
transition function is implicitly defined in terms of that description. In this paper, 
we adopt the language B [Gelfond and Lifschitz 1998], which is a subset of the 
language proposed in [Turner 1997], for its simple syntax and its capability to rep- 
resent relationships between fluents, an important feature lacking in many variants 
of the action description language A [Gelfond and Lifschitz 1993]. We note that 
the main results of this paper can be used in answer set planning systems which 
use other languages for representing and reasoning about the effects of actions. 

We now present the basics of the action description language B. An action theory 
in B is defined over two disjoint sets, a set of actions A and a set of fluents F, which 
are defined over a signature c = (O, AN, FN) where 

— O is a finite set of object constants; 

— AN is a finite set of action names, each action name is associated with a number 

n, n > 0, which denotes its arity; and 
— FN is a finite set of fluent names, each fluent name is associated with a number 

n, n > 0, which denotes its arity. 

An action in A is of the form a(ci, . . . , c„) where a G AN is a n-ary action name 
and Ci is a constant in O. A fluent in F is of the form /(ci, . . . , c^) where / e FN 
is an m-ary fluent name and Cj is a constant in O. For simplicity, we often write a 
and / to represent an action or a fluent whenever it is unambiguous. Furthermore, 
we will omit the speciflcation of a when it is clear from the context. 
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A fluent literal is either a fluent / G F or its negation A fluent formula is a 
prepositional formula constructed from fluent literals. An action theory is a set of 
propositions of the following form: 



where / and pi's are fluent literals and a is an action. (1) represents a static causal 
law, i.e., a relationship between fluents^. It conveys that whenever the fluent literals 
Pi, . . . ,Pn hold in a state, that causes / to also hold in that state. (2), referred to 
as a dynamic causal law, represents the (conditional) effect of a while (3) encodes 
an executability condition of a. Intuitively, an executability condition of the form 
(3) states that a can only be executed if p^'s holds. A dynamic law of the form 
(2) states that / is caused to be true after the execution of a in any state of the 
world where pi,. . . ,Pn are true. Propositions of the form (4) are used to describe 
the initial state. They state that / holds in the initial state. 

An action theory is a pair (D, T) where T, called the initial state, consists of propo- 
sitions of the form (4) and D, called the domain description, consists of propositions 
of the form (l)-(3). For convenience, we sometimes denote the set of propositions 
of the form (1), (2), and (3) by Dc, Dn, and De, respectively. 

Example 2.1. Let us consider a modifled version of the suitcase s with two 
latches from [Lin 1995]. We have a suitcase with two latches h and l2- h is up 
and I2 is down. To open a latch (Zi or I2) we need a corresponding key (fci or 
k2, respectively). When the two latches are in the up position, the suitcase is 
unlocked. When one of the latches is down, the suitcase is locked. The signature 
of this domain consists of 

— O = {Ii,l2,s,ki,k2}; 

— AN = {open, close}, both action names are associated with the number 1, and 
— FN = {up, locked, holding}, all fluent names are associated with the number 1. 

In this domain, we have that 



We now present the propositions describing the domain. 



constraint between fluents can also be represented using static causal laws. For example, to 
represent the fact that a door cannot be opened and closed at the same time, i.e. the fluents 

opened and closed cannot be true at the same time, we introduce a new fluent, say inconsistent, 
and represent the constraint by two static causal laws caused[{opened, closed} , inconsistent) and 
caused({opened, closed}, ^inconsistent). 



caused({pi,...,p„},/) 
causes(a,/, {pi,...,p„}) 
executable(a, {pi, . . . ,Pn}) 
initially(/) 



(1) 
(2) 
(3) 
(4) 



A = {open{li), open{l2), close{li), close{l2)} 



and 



F — {locked{s) , up{l2), up{li), holding{ki), holding{k2)}. 
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Opening a latch puts it into the up position. This is represented by the dynamic 
laws: 

causes(open(Zi), up{li), {}) and causes(open(Z2), upih), {})• 
Closing a latch puts it into the down position. This can be written as: 

causes(cZose(Zi), -<up{li), {}) and causes(dose(Z2), ~^up{h)-, {})• 
We can open the latch only when we have the key. This is expressed by: 
executable(open(Zi), {holdmg{ki)}) and executable(open(Z2), {holding{k2)})- 

No condition is required for closing a latch. This is expressed by the two proposi- 
tions: 

executable(dose(li), {}) and executable(dose(/2), {})• 

The fact that the suitcase will be unlocked when the two latches are in the up 
position is represented by the static causal law: 

caused({up(Zi), up{l2)}, -^locked{s)). 

Finally, to represent the fact that the suitcase will be locked when either of the two 
latches is in the down position, we use the following static laws: 

caused({-iMp(/i)}, Zocfcerf(s)) and caused({-iup(Z2)}, /ocfcecJ(s)) 

The initial state of this domain is given by 

' initially(wp(^i)) 

initially(-iup(Z2)) 

r = < initially(Zocfce(i(s)) 

initially (-i/ioWm(7 (/ci ) ) 
^ initially(/ioZrfmfif(fc2)) 

□ 

A domain description given in B defines a transition function from pairs of actions 
and states to sets of states whose precise definition is given below. Intuitively, given 
an action a and a state s, the transition function $ defines the set of states $(a, s) 
that may be reached after executing the action o in state s. If $(a, s) is an empty 
set it means that the execution of a in s results in an error. We now formally define 

Let be a domain description in B. A set of fluent literals is said to be consistent 
if it does not contain / and -1/ for some fluent /. An interpretation I of the fluents 
in D is a maximal consistent set of fluent literals of D. A fluent / is said to be 
true (resp. false) in / iff / G / (resp. -1/ S /). The truth value of a fluent formula 
in I is defined recursively over the propositional connectives in the usual way. For 
example, / A g is true in / iff / is true in / and g is true in /. We say that a formula 
if holds in / (or / satisfies <^), denoted by / |= lyj, if 1^ is true in /. 

Let M be a consistent set of fiuent literals and K a set of static causal laws. We say 
that u is closed under K if for every static causal law 

caused({pi,...,p„},/) 
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in K, if u \= piA. . .Apn then u\= f. By CIk{u) we denote the least consistent set 
of literals from D that contains u and is also closed under K. It is worth noting 

that CIk{u) might be undefined. For instance, if u contains both / and ^/ for 
some fluent /, then CIk{u) cannot contain u and be consistent; another example 
is that if « = {/, g} and K contains 

caused({/}, h) 

and 

caused({/, (?},-. /i), 

then CIk{u) does not exist because it has to contain both h and -i/i, which means 
that it is inconsistent. 

Formally, a state of D is an interpretation of the fluents in F that is closed under 

the set of static causal laws Dc of D. 

An action a is executable in a state s if there exists an cxecutability proposition 

executable(a, {/i, . . . , /„}) 
in D such that s |= /i A . . . A /„. Clearly, if 

executable(a, {}) 
belongs to D, then a is executable in every state of D. 
The direct effect of an action a in a state s is the set 

E{a, s) = {/ I causes(a, /, {/i, . . . , /„}) G D,s \= fi A . . . A /„}. 

For a domain description D, $(a, s), the set of states that may be reached by 
executing a in s, is defined as follows. 

(1) If a is executable in s, then 

$(a, s) = {s' I s' is a state and s' = ClDc{E{a, s) U (s n s'))}; 

(2) If a is not executable in s, then $(a, s) = 0. 

The intuition behind the above formulation is as follows. The direct effects of an 
action a in a state s are determined by the dynamic causal laws and arc given by 
E(a, s). All fluent literals in E{a, s) must hold in any resulting state. The set sHs' 
contains the fluent literals of s which continue to hold by inertia, i.c they hold in s' 
because they were not changed by an action. In addition, the resulting state must 
be closed under the set of static causal laws Dq- These three aspects are captured 
by the definition above. Observe that when Dc is empty and a is executable in 
state s, <I>(a, s) is equivalent to the set of states that satisfy E{a., ,s) and are closest 
to s using symmetric difference^ as the measure of closeness [McCain and Turner 
1995]. Additional explanations and motivations behind the above definition can be 
found in [Baral 1995; McCain and Turner 1995; Turner 1997]. 



*We say si is strictly closer to s than S2 if si \ s U s \ si C S2 \ s U s \ S2. 
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Every domain description D in B has a unique transition function and we say 
$ is the transition function of D. We iUustrate the definition of the transition 
function in the next example. 

Example 2.2. For the suitcase domain in Example 2.1, the initial state given 
by the set of propositions 

initially(Mp(ii)) 

initially(->Mp(Z2)) 

r = < initially(?ocfce(i(.s)) 

initially (-i/ioWmg (/ci ) ) 
^ initia.lly{holding{k2)) 

is 

So = {up{li), -^up{l2), locked{s), -iholding{ki), holding{k2)}- 

In state sq, the three actions open{l2), close{li), and dose{l2) are executable. 

open{l2) is executable since holding{k2) is true in sq while close{li) and close(l2) 
are executable since the theory (implicitly) contains the propositions: 

executable(c/ose(/i), {}) and executable(dose(Z2), {}) 

which indicate that these two actions are always executable. The following transi- 
tions are possible from state sq- 

{ up{li), up{l2), -^locked{s), -^holding{ki), holding{k2) } G $(open(Z2), so)- 
{up{li),->up{l2),locked{s),-'holding{ki),holding{k2) } G $(dose(Z2), sq). 
{ -iup(Zi), -^up{l2), locked{s), -<holding{ki), holding{k2) } G $(dose(Zi), sq). 



□ 

For a domain description D with transition function a sequence sooosi . . . a„_iSrj 
where Sj's are states and aj's are actions is called a trajectory in D if Sj+i € 
$(si, ttj+i) for i e {0, . . . , n — 1}. A trajectory soaosi . . . a„_is„ achieves a fluent 
formula A if s„ |= A. 

A domain description D is consistent iff for every action a and state ,s, if a is 
executable in s, then $(a, s) ^ 0. An action theory {D,T) is consistent if D is 
consistent and so = {/ | initially(/) € F} is a state of D. In what follows, we will 
consider only^ consistent action theories. 

A planning problem with respect to B is specified by a triple (D, F, A) where {D. F) 
is an action theory in B and A is a fluent formula (or goat), which a goal state must 
satisfy. A sequence of actions ao, . . . , am-i is then called a possible plan for A if 
there exists a trajectory soaosi ...a„,_is„i in D such that .so and .s„, satisfies F 
and A, respectively. Note that we define a 'possible plan' instead of a 'plan'. This 
is because the presence of static causal laws in D allows the possibility that the 

^We thank one of the anonymous referee for pointing out that without this assumption, finding a 
plan would be SsP-complete even with respect to a complete initial state. 
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effects of actions may be non-deterministic, and planning with non-deterministic 
actions has the complexity of S2P-complete [Turner 2002] and hence is beyond the 

expressiveness of AnsProlog. However, if D is deterministic, i.e.. |$(a.s)| < 1 for 
every pair (a, s) of actions and states, then the notions of 'possible plan' and 'plan' 
coincide. 

2.2 Logic Programming with answer set semantics (AnsProlog) and its application 

In this section we review AnsProlog (a sub-class of AnsProlog*) and its applicability 
to problem solving. 

2.2.1 AnsProlog. Although the programming language Prolog and the field of 
logic programming have been around for several decades, the answer set semantics 
of logic programs - initially referred to as the stable model semantics, was only pro- 
posed by Gelfond and Lifschitz in 1988 [Gelfond and Lifschitz 1988]. Unlike earlier 
characterizations of logic programs where the goal was to find a unique appropriate 
'model' of a logic program, the answer set semantics allows the possibility that a 
logic program may have multiple appropriate models, or no appropriate models at 
all. Initially, some considered the existence of multiple or no stable models to be 
a drawback of stable model semantics, while others considered it to be a reflection 
of the poor quality of the program in question. Nevertheless, it is this feature of 
the answer set semantics [Marek and Truszczyhski 1999; Niemela 1999; Lifschitz 
1999b] that is key to the use of AnsProlog for problem solving. We now present 
the syntax and semantics of AnsProlog, which we will simply refer to as a logic 
program. 

A logic program 11 is a set of rules of the form 

ao ^ ai, . . .,am, not a^+i, . . . ,not a„ (5) 

or 

± ^ ai, . . . , a™, not a„i+i, . . . , not a„ (6) 

where < m < n, each is an atom of a first-order language CP, _L is a special 
symbol denoting the truth value false, and not is a connective called negation-as- 
failure. A negation as failure literal (or naf-literal) is of the form not a where a is 
an atom. For a rule of the form (5)- (6), the left and right hand side of the rule are 
called the head and the body, respectively. A rule of the form (6) is also called a 
constraint. 

Given a logic program 11, we will assume that each rule in 11 is replaced by the 
set of its ground instances so that all atoms in 11 are ground. Consider a set of 
ground atoms X. The body of a rule of the form (5) or (6) is satisfied by X if 
{am+i, ■ ■ ■ , dn} n X = and {ai, . . . , am} C X. A rule of the form (5) is satisfied 
by X if either its body is not satisfied hj X or ao € X. A rule of the form (6) is 
satisfied by X if its body is not satisfied by X. An atom a is supported by X if a 
is the head of some rule of the form (5) whose body is satisfied by X. 

For a set of ground atoms S and a program 11, the reduct of 11 with respect to S, 
denoted by 11^, is the program obtained from the set of all ground instances of 11 
by deleting 
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(1) each rule that has a naf-hteral not a in its body with a € S, and 

(2) all naf-literals in the bodies of the remaining clauses. 

S is an answer set (or a stable model) of 11 if it satisfies the following conditions. 

(1) If n does not contain any naf-literal (i.e. m = n in every rule of 11) then S is 
the smallest set of atoms that satisfies all the rules in 11. 

(2) If the program IT does contain some naf-literal (to < n in some rule of 11), then 

S is an answcir set of 11 if 5* is the answcir set of 11''' . (Note that 11'^ does not 
contain naf-literals, its answer set is defined in the first item.) 

A program 11 is said to be consistent if it has an answer set. Otherwise, it is 
inconsistent. 

Many robust and efficient systems that can compute answer sets of propositional 
logic programs have been developed. Two of the frequently used systems are dlv 
[Eitcr ct al. 1998] and smodels [Simons et al. 2002]. Recently, two new systems 
cmodels [Babovich and Lifschitz ] and ASS AT [Lin and Zhao 2002], which com- 
pute answer sets by using SAT solvers, have been developed. XSB [Sagonas et al. 
1994], a system developed for computing the well-founded model of logic programs, 
has been extended to compute stable models of logic programs as well. 

2.2.2 Answer set programming: problem solving using AnsProlog. Prolog and 
other early logic programming systems were geared towards answering yes/no 
queries with respect to a program, and if the queries had variables they returned 
instantiations of the variables together with a 'yes' answer. The possibility of mul- 
tiple answer sets and no answer sets has given rise to an alternative way to solve 
problems using AnsProlog. In this approach, referred to as answer set programming 
(also known as stable model programming) [Marek and Truszczyhski 1999; Niemela 
1999; Lifschitz 1999b], possible solutions of a problem are enumerated as answer 
set candidates and non-solutions are eliminated through rules with _L in the head, 
resulting in a program whose answer sets have one-to-one correspondence with the 
solutions of the problem. 

We illustrate the concepts of answer set programming by showing how the 3-coloring 
problem of a bi-directed graph G can be solved using AnsProlog. Let the three 
colors be red (r), blue (6), and green [g) and the vertex of G be 0, 1, . . . ,n. Let 
P{G) be the program consisting of 

— the set of atoms edge{u, v) for every edge (m, v) of G, 

— for each vertex u of G, three rules stating that u must be assigned one of the 
colors red, blue, or green: 

color {u, g) <— not color {u, b), not color {u, r) 
color{u, r) •*— not color{u, b), not color{u, g) 
color{u, b) ■*— not color{u, r), not color{u, g) 

and 
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— for each edge (it, v) of G, three rules representing the constraint that u and v 
must have different color: 

_L <— color {u, r), color{v, r), edge{u, v) 
_L <— color{u, b), color{v, b), edge{u, v) 
_L <— color{u, g), color {v, g), edge{u, v) 

It can be shown that for each graph G, (i) P(G) is inconsistent iff the 3-coloring 
problem of G does not have a solution; and (ii) if P{G) is consistent then each 
answer set of P{G) corresponds to a solution of the 3-coloring problem of G and 
vice versa. 

To make answer set style programming easier, Niemcla ct al. [Niemela et al. 1999] 
introduce a new type of rules, called cardinality constraint rule (a special form of 
the weight constraint rule) of the following form: 

l{bi, bk}u •*— ai, . . . , am, not a^+i, . . . , not an (7) 

where a^ and bj are atoms and I and u are two integers, I < u. The intuitive 
meaning of this rule is that whenever its body is satisfied then at least I and at 
most u atoms of the set {5i, . . . , bk} must be true. Using rules of this type, one 
can greatly reduce the number of rules of programs in answer set programming. 
For instance, in the above example, the three rules representing the constraint that 
every node u needs to be assigned one of the three colors can be packed into one 
cardinality constraint rule: 

l{color{u, g), color{u, r), color{u, b)}l <— 

The semantics of logic programs with such rules is given in [Niemela et al. 1999] 
where a program with weight constraint rules is translated into a normal logic 
program whose answer sets define the answer sets of the original program. For our 
purpose in this paper wc only need to consider rules with / < 1, u = 1, and restrict 
that if we have rules of the form (7) in our program then there are no other rules 
with any oi bi, . . . ,bk in their head. 

3. ANSWER SET PLANNING: USING ANSPROLOG FOR PLANNING 

The idea of using logic programming with answer set semantics for planning was 
first introduced in [Subrahmanian and Zaniolo 1995]. It has become more feasible 
since the development of fast and efficient answer set solvers such as smodels 
[Simons ct al. 2002] and dlv [Eiter et al. 1998]. The term "answer set planning^^ 
was coined by Lifschitz in [Lifschitz 1999b] referring to approaches to planning 
using logic programming with answer set semantics, where the planning problem 
is expressed as a logic program and the answer sets encode plans. In that paper 
answer set planning is illustrated with respect to some specific examples. 

We now present the main ideas of answer set planning^ when the effects of actions 
on the world and the relationships between fluents in the world are expressed in 

^Notc that while [Lifschitz 1999b; 1999a; 2002] illustrated answer set planning through specific 
examples, the papers [Lifschitz and Turner 1999; Lifschitz 1999a] mapped reasoning (not planning) 
in the action description language C to logic programming. 
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the action description language B. Given a planning problem (£>,r,A), answer 
set planning solves it by translating it into a logic program Il{D, T, A) (or 11, for 
short) consisting of dom,ain- dependent rules that describe D, F, and A and dom,ain- 
independent rules that generate action occurrences and represent the inertial law. 
We assume that actions and fluents in A and F are specified by facts of the form 
action (.) and fluent{.), respectively. The rules of 11 are adapted mainly from 
[Dimopoulos et al. 1997; Lifschitz 1999b; Lifschitz and Turner 1999; Lifschitz 1999a] 
and based on conversations with Michael Gelfond in 1998-99. As customary in the 
encoding of planning problems, we assume that the length of plans we are looking 
for is given. We denote the length by the constant length and use a sort time, 
whose domain is the set of integers from to length, to represent the time moment 
the system is in. We begin with the set of domain-dependent rules. 

3.1 Domain-dependent rules 

For an action theory {D,r), the encoding of {D,T) uses the following predicates: 

(1) holds (F, T) - the fluent literal F holds at the time moment T; 

(2) occ{A, T) - the action A occurs at the time moment T; and 

(3) possible{A, T) - the action A is executable at the time moment T. 

The translation is as follows''. 

(1) For each proposition 

initially(?) 

in F, the fact 

holds{l,0) (8) 

belongs to H. This says that at the time moment 0, the fluent literal I holds. 

(2) For each executability condition 

executable (a, {pi, . ■ . ,Pn}) 
in D, n contains the rule: 

possible{a, T) <— time{T), holds{p\, T),. . ., holds{pn, T). (9) 

This rules states that it is possible to execute the action a at the time moment 

T if {pi, . . . ,pn} holds at T. 

(3) For each dynamic causal law 

causes (a,/, {pi,...,p„}) 

in D, n contains the rule: 

holds{f,T+l) ^ time{T),occ{a,T), , , 

possible{a,T),holds{pi,T), . . . ,holds{pn,T). ^ ' 



Sicstus-program that translates B planning problems into their corresponding smodels en- 
codings is available at http : //www . cs .nmsu. edu/'tson/ASPlan/Knowledge. (An earlier version of 
this translator was posted to the TAG discussion web site http://www.cs.utexas.edu/users/vl/ 
tag/discussions. html). 
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This rule says that if a occurs at the time moment T and a is executable at T 
then the fluent literal / becomes true at T + 1 if there exists a dynamic law 

causes(a,/, {pi,...,p„}) 

in D and the pj's hold at T. 
(4) For each static causal law 

caused({pi,...,p„},/) 
in D, n contains the following rule: 

holds{f, T) ^ time{T), holds{pi, T), . . . , holds{pn, T). (11) 

This rule is a straightforward translation of the static causal law into logic 

programming rule. 

We demonstrate the above translation by encoding the blocks world domain from 
Example 2.1. 

Example 3.1. The rules encoding the fluents and actions of the suitcase domain 
in Example 2.1 are: 

action{open{l\)) <— fluent{up{l\)) <— 

action{open{l2)) <— fluent{up{l2)) ^ 

action{close{li)) <— fluent{locked{s)) ^ 

action{close{l2)) <— fluent{holding{ki))) <— 

fluent {hoi ding {k2))) <— 

The first group of rules (left column) define the set A and the second group of rules 
(right column) define F. The dynamic law ca.uses{open{li) , up{li) , {}) is translated 
into the rule: 

holds{up{li),T + 1) ^ time{T),occ{open{h), T). 

The dynamic law causes(clo.se(/i), -'up{li), {}) is translated into the rule: 

holdsi^upih), T + 1) ^ Ume{T), occ{close{li), T). 

The cxccutability condition executable(open(Zi), {holding{k\)}) is translated into 
the rule: 

possihle{open{li),T) ^ time{T), holds{holding{ki),T). 
The static causal law caused{{up{li),up(l2)},~'locked{s)) is encoded by the rule: 

holds{-^locked{s),T) ^ time{T),holds{up{li),T),holds{up{l2),T). 
The static causal law caused({-iMp(li)}, locked{s)) is encoded by the rule: 

holds{locked{s), T) ^— time{T), holds{-^up{li),T). 
The encoding of other propositions of the domain is similar. □ 
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3.2 Domain independent rules 

The set of domain independent rules of 11 consists of rules for generating action 
occurrences and rules for defining auxiliary predicates. First, we present the rules 
for the generation of action occurrences. 

occiA,T) ^ action{A),time{T),pos.sible{A,T),notnocc{A,T). (12) 
nocc{A,T) ^ action{A),action{B),time{T), A ^ B,occ{B,T). (13) 

In the above rules, A and B are variables representing actions. These rules generate 
action occurrences, one at a time The rules of inertia (or the frame axioms) and 
rules defining literals are encoded using the following rules: 

Uteral{F) ^ fluent{F). (14) 

literali^F) ^ fluent{F). (15) 

contrary{F,-iF) ^ fluent{F). (16) 

contrary{-iF, F) <— fluent{F). (17) 

holds{L,T+l) ^ literal{L),literal{G),time{T), (18) 
contrary{L, G),holds{L, T), not holds{G, T+1). 

The first two rules define what is considered to be a literal. The next two rules say 
that -iF and F are contrary literals. The last rule says that if L holds at T and 
its contrary does not hold at T + 1 , then L continues to hold at T + 1 . Finally, to 
represent the fact that -^F and F cannot be true at the same time, the following 
constraint is added to 11. 

± ^ fluent{F),holds{F,T),holds{-nF,T). (19) 

3.3 Goal representation 

The goal A is encoded by two sets of rules. The first set of rules defines A as a 
formula over fluent literals and the second set of rules evaluates the truth value of 
A at different time moments. In a later section, we show how fluent formulas can 
be represented and evaluated. In this section, we will assume that A is simply a 
conjunction of literals, i.e., 

A = pi A . . . A pfe 
where pi are literals. Then, A is represented by the following rules: 

goal <— holds{pi, n), . . . , holds{pk, n). (20) 

(Recall that the constant n denotes the maximal length of trajectories that we are 
looking for.) 



These two rules can be replaced by the smodels cardinality constraint rule 

0{occ{A,T) : action{A)}l «- time{T) 

and a set of constraints that requires that actions can occur only when they are executable and 
when some actions are executable then one must occur. In many of our experiments, programs 
with these rules yield better performance. 
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3.4 Correctness of 11 

Let Iln{D, r, A) be the logic program consisting of 

— the set of rules encoding D and T (rulcH (8)-(ll)) in which the domain of T is 
{0, . . . , n} and the rules define actions and fluents of {D, F). 

— the set of domain-independent rules (rules (12)-(19)) in which the domain of T 
is {0,...,n}, 

— the rule in (20) and the constraint ± <— not goal{n) that encodes the requirement 

that A holds at n. 

In what follows, wc will write n„ instead of II,, {D. F, A) when it is clear from the 
context what D, F, and A arc. The following result (similar to the correspondence 
between histories and answer sets in [Lifschitz and Turner 1999]) shows the equiv- 
alence between trajectories achieving A and answer sets of n„. Before stating the 
theorem, we introduce the following notation: for an answer set M of n„, we define 

s,(M) = {/ I / is a fluent literal and holds{f, i) e M}. 

Theorem 3.2. For a planning problem {D, F, A) with a consistent action theory 
(AT), 

(i) if SqUo ■ ■ ■ a„„is„ is a trajectory achieving A, then there exists an answer set 
M of n„ such that 

(1) occ{ai, i) e M for i G {0, . . . , n — 1} and 

(2) Si = Si{M)hiiG{0,...,n}. 
and 

(ii) if M is an answer set of n„, then there exists an integer < k < n such 
that so{M)ao . . . ak-iSk{M) is a trajectory achieving A where occ{ai, i) G M for 
< z < fc and ii k < n then no action is executable in the state Sk{M). 

Proof. See Appendix A.l □ 

Note that the second item of the theorem implies that the trajectory achieving A 
corresponds to an answer set M of n„ that could be shorter than the predefined 

length n. This happens when the goal is reached with a shorter sequence of actions 

and no action is executable in the resulting state. 

Recall that the sequence of actions ao, ai, . . . , a„_i, where soaosi . . . a„_is„ is a tra- 
jectory achieving A, is not necessarily a plan achieving the goal A because the action 
theory (-D,r) may be non-deterministic. It is easy to see that whenever {D,T) is 
deterministic, if sooosi • • • On-iSn is a trajectory achieving A then ao, ai, . . . , a„_i 
is indeed a plan achieving A. The next corollary follows directly from Theorem 3.2. 

Corollary 3.3. For a planning problem {D, T, A) with a consistent and deter- 
ministic action theory {D,T), 

(1) for each plan oq, . . . , a„_i achieving A from F, there exists an answer set M of 
n„ such that occ{ai,i) S M for i e {0, . . . , n — 1}; and 

(2) for each answer set M of n„, there exists an integer < fc < n such that 
ao, . . . , a/s_i is a plan achieving A from F where occ{ai, i) e M for < i < fc 
and if fc < n then no action is executable in the state reached after executing 
ao, . . . , ttk-i in the initial state. 
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The first item of tlie corollary follows from Item (i) of Theorem 3.2. Nevertheless, we 
do not need to include the condition on the states Si{M) because Si{M) is uniquely 
determined by the initial state sq and the scqncncc of actions floi • • • The 
second item of the corollary follows from Item (ii) of Theorem 3.2. Again, because 
of the determinism of {D, F), we do not need to include the conditions on the states 
SiiM). 

4. CONTROL KNOWLEDGE AS CONSTRAINTS 

In this section, wc present the main contribution of this paper: augmenting the 
answer set planning program 11, introduced in the previous section, with different 
kinds of domain knowledge, namely temporal knowledge, procedural knowledge, 
and HTN-based knowledge. The domain knowledge acts as constraints on the 
answer sets of 11. For each kind of domain knowledge (also referred to as constraints) 
we introduce new constructs for its encoding and present a set of rules that check 
when a constraint is satisfied. Wc now proceed to introduce the different types of 
control knowledge. Wc start with temporal knowledge. 

4.1 Temporal Knowledge 

Use of temporal domain knowledge in planning was first proposed by Bacchus and 
Kabanza in [Bacchus and Kabanza 2000]. In their formulation, temporal knowledge 
is used to prune the search space while planning using forward search. In their 
paper, temporal constraints are specified using a future linear temporal logic with 
a precisely defined semantics. Since their representation is separate from the action 
and goal representation, it is easy to add them to (or remove them from) a planning 
problem. Planners exploiting temporal knowledge to control the search process 
have proven to be highly efficient and to scale up well [Bacchus et al. 2000]. In 
this paper, we represent temporal knowledge using temporal constraints. Temporal 
constraints are built from fluent formulae using the temporal operators always, 
until, next, and eventually, and a special goal operator goal. For simplicity 
of the presentation, we will write fluent formulae in prefix notation and use the 
propositional connectives and, or, and negation. Given a signature (O, AN, BN) 
we define term, formula, and closed formula, as follows. 

Definition 4.1. A term is a variable or a constant in O. 

Definition 4.2. A formula is either 

— an expression of the form /(ai, . . . , C7„) where / is a n-ary fluent name and each 

(Ti is a term; 

— an expression of the form and(0, where tj) and ^jJ are formulae; 
— an expression of the form or(^, •0), where (p and tp are formulae; 
— an expression of the form negation((/)), where (f) is a formula; or 
— an expression of the form (3X.{ci, . . . , c„}(^) or (VX.{ci, . . . , c„}0) where X is a 
variable, {ci, . . . , c„} is a set of constants in O, and (/> is a formula. 

We next define the notion of a closed formula. 

Definition 4.3. The formula over which a quantifier applies is called the scope 
of the quantifier. The scope of VX.{ci, . . . , c„} (resp. 3X.{ci, . . . , c„}) in 
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(VX.{ci, . . . , c„}(/>) (resp. (3X.{ci, . . . , c„}</))) is (j). An occurrence of a variable 
in a formula is a bounded occurrence iff the occurrence is within the scope of a 
quantifier which has the same variable immediately after the quantifier or is the 
occurrence of that quantifier. An occurrence of a variable in a formula is a free 
occurrence iff the occurrence is not bound. A variable is free in a formula if at least 
one of its occurrences is a free occurrence. 

Definition 4.4. A formula without free variables is called a closed formula. 

Remark 4.5. The truth or falsity of a formula is evaluated with respect to state in 
the standard way. It is easy to see that formulae with quantifiers can be translated 
into equivalent formulae without quantifier as follows: VX.{ci, Cn}0 is equiva- 
lent to Ar=i "^(cj) SLiid 3X.{ci, . . . , Cn}(j) is equivalent to Vr=i ^i^^i) where 4>{c.i) is 
the formula obtained from <j) by replacing every free occurrence oi X m. <j) with a. 
For this reason, we will be dealing with formulae without quantifiers hereafter. 

We are now ready to define the notion of a temporal constraint. 

Definition 4.6. A temporal constraint'^ either 

— a closed formula (Definition 4.4) 

— an expression of the form goal((/)) where is a closed formula; or 
— an expression of the form and((/>, V'), or{(j),tp), negation((/)), until(0, ■)/)), 
always((?i), eventually(0), or next(0) where (p and tjj are temporal constraints. 

A temporal constraint is an atomic constraint if it is a fluent literal. Otherwise, 

it is called non-atomic. In what follows, a constraint (f) will be referred as a sub- 
constraint of a constraint ip \i (f) occurs in ip. We will write sub{(j)) to denote the set 
of constraints consisting of (p and its sub-formulae. It is easy to see that constraints 
without temporal operators or the goal operator are indeed fluent formulae. Tem- 
poral operators are understood with their standard meaning while the goal operator 
goal provides a convenient way for expressing the control knowledge which depends 
on goal information. A temporal constraint is said to be goal-independent if no goal 
formula occurs in it. Otherwise, it is goal-dependent. Bacchus and Kabanza [Bac- 
chus and Kabanza 2000] observed that useful temporal knowledge in planning is 
often goal-dependent. In the blocks world domain, the following goal-dependent 
constraint^: 

always(and(goal(on(A:, tbl)), on{X, tbl)) D next(on(A:, tbl))) (21) 

can be used to express that if the goal is to have a block on the table and it is 
already on the table then it should be still on the table in the next moment of time. 
This has the effect of preventing the agent from superfluously picking up a block 
from the table if it is supposed to be on the table in the goal state. 

Notice that under this definition, temporal operators can be nested many times 
but the goal operator goal cannot be nested. For instance, if is a fluent formula, 
always(next(93)) is a temporal formula, but goaI(goaI(</3)) is not. 

^Because material implication (denoted by D) can be replaced by V and -i, we omit it in the 
definition but use it in writing the constraints, to simplify reading. As before, we use the convention 
that a formula with variables represents the set of its ground instantiations. 
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Goal-independent formulae will be interpreted over an infinite sequence of states of 
D, denoted by 7 = (sq, si, . . . , ). On the other hand, goal-dependent formulae will 
be evaluated with respect to a pair (/, (p) where 7 is a sequence of states and ip is 
a fluent formula. In the next two definitions, we formally define when a constraint 
is satisfied. Definition 4.7 deals with goal- independent constraints while Definition 
4.8 is concerned with general constraints. 

Definition 4.7. (Sec [Bacchus and Kabanza 2000]) Let 7 = {sq, si, . . . , s„, . . .) be 
a sequence of states of D. Let /i and /2 be goal-independent temporal constraints, 
t be a non- negative integer, and fs be a fluent formula. Let It = {sf, s^+i, . . . , ) 
denote the subsequence of 7 starting from St ■ We say that 7 satisfies / (/ is either 
/i, .f2, or /a), denoted by 7 |= /, iff 7o ^ / where 

—h 1= /3 iff St h /3. 

— It \= until(/i, /2) iff there exists t <t2 such that It^ \= h and for all t < ti < t2 
we have It^ \= /i. 

— 7t Hnext(/i) iSIt+i ^ fi. 

— It \= eventually (/i) iff there exists t <ti such that It^ \= fi- 
— It \= always(/i) iff for all t <ti we have It^ \= fi- 

For a finite sequence of states 7 = (sq, . . . , s„) and a goal-independent temporal 
constraint /, we say that 7 satisfies /, denoted by 7 ^ /, if 7' [= / where 7' = 

{so, ■ ■ ■ ,Sn,Sn, ■ ■ ■)■ Q 

Next we define when goal-dependent temporal constraints are satisfied by a se- 
quence of states and a goal. Intuitively, this should be a straightforward extension 
of the previous definition in which formulas of the form goal{ip) need to be ac- 
counted for. Obviously, such a constraint can only be evaluated with respect to 
a sequence of states and a formula encoding the goal. Furthermore, the intuition 
behind the formula goa\{tp) is that tjj is true whenever the goal is true, i.e., ip is 
entailed by the goal. This is detailed in the second item of the following definition. 

Definition 4.8. Let 7 = (sq, Si, . . . , s„, . . .) be a sequence of states of D and 
ip he a fluent formula denoting the goal. Let /i and /2 be temporal constraints 
(possibly goal dependent), i be a non-negative integer, and /s be a fluent formula. 
Let It = {st, St+i, ...,). We say that 7 satisfies / (/ is either /i, /2, or /s) with 
respect to ip, denoted by (7, ip) \= /, iff (7o, ip) \= f where 

-a*,^) h/siffst h/3. 
-(7t,^) hgoal(/3) iff^h/3 

— {It,<f) \= until(/i,/2) iff there exists t < such that {It^,ip) \= and for all 
t < ii < i2 we have {It^ , ip) \= fi. 

-{It,v) H next(/i) iff {It+u^) h /i- 

— (Iti'p) \= eventually(/i) iff there exists t < ti such that {Iti,ip) \= f\. 
— {It, if) \= always(/i) iff for aU t < ii we have {It^ ,ip) \= fi- 



i^Here, by |= /a we mean that ip entails fz. 
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For a finite sequence of states / = {sq, . . . ,Sn), a temporal constraint /, and a 
fluent formula (p we say that / satisfies / with respect to (p, denoted by (/, (p) \= f, 
if {!', </?)!=/ where /' = {sq, . . . , s„, s„, . . .). □ 

To complete the encoding of temporal constraints, we now provide the rules that 
check the satisfiability of a temporal constraint given a trajectory. We define 
the predicate hf{F,T) whose truth value determines whether F is satisfied by 
{st, St+1, ■ ■ ■ Sn), where st refers to the state corresponding to time point T. It 
is easy to see that rules for checking the satisfiability of temporal constraints can 
be straightforwardly developed in logic programming with function symbols. For 
example, the rules 

hf{L,T) ^ holds {L,T), liter al{L) 
hf(and{Fi,F2),T) ^ hf{Fi,T),hf{F2,T) 

can be used to determine whether or not the constraint and(Fi,F2) is true at the 
time moment T. The first rule is for atomic constraints and the second rule is for 
non-atomic ones. Although these rules are intuitive and correct, we will need to 
modify them for use with the currently available answer set solvers such as dlv and 
smodels. This is because dlv does not allow function symbols and Iparse - the 
parser of the smodels system - requires that variables occurring in the head of a 
rule are domain variables, i.e., in the second rule, we have to specify the domain of 
Fi and F^. 

We will now present two possible ways to deal with the answer set solver's restric- 
tion". The first way is to represent a constraint by a set of rules that determine 
its truth value. In other words, we specify the domains of Fi and F2 in the above 
rules by grounding them. For example, for the conjunction and(/, 51), the rules 

hf{L,T) ^ liter al{L),holds{L,T) 
hf{and{f,g),T) ^ hf{f,T),hf{g,T) 

can be used. For the disjunction, ov{f,and{g,h)), the rules 

hf{L,T) ^ literal{L),holds{L,T) 
hfior{f,andig,h)),T) ^ hf{!,T). 
hf{or{f,and{g,h)),T) hf{and{g,h),T). 
hf{and{g,h),T) ^ hf{g,T),hf{h,T). 

can be used. The encodings of other constraints are similar. Observe that the 
number of rules for encoding a formula depends on the number of its sub- constraints. 

An alternative to the above encoding is to assign names to non-atomic constraints, 
to define a new type, called formula, and to provide the constraint-independent 



'^^Another alternative for dealing with temporal constraints such as fluent formulae is 
to convert them into disjunctive normal form and to develop, for each conjunction 
and(/i,and(/2, . . ., and{/„_i, /„))), a rule 

/i/{and(/i, and(/2, . . . , and(/„_i, /„))), T) <- holds{fi,T), holds{f„,T). 

This method, however, cannot be easily extended for temporal constraints with temporal operators 
or the goal operator. 
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rules for checking the truth value of constraints. Atomic constraints are defined by 
the rule: 

formula{L) <— literal{L). 

For each non-atomic formula (j), we associate with it a unique name and encode it 
by a set of facts, denoted by r{(j)). This set is defined inductively over the structure 
of (f) as follows. 

— If (/) is a fluent literal I then r{(j)) = {^}; 

— If (j) = (pi A(p2 then r{(p) — r{(j)i) U r(02) U {formula{n^), and{n^, n^^,n^^)}] 
— If (l) = (p^y (j)2 then r{(t>) = r{(t>i) Ur{(j)2) U {formula{n^),or{n^,n^^,n^^)}; 
— If (j) = then r{(f)) = r(0i) U {formula{n^),negation(n^,n^-^)}\ 
— If (j) = next(0i) then r{(l)) = r{(j)) U {formula{n^),next{n^,n^^)}; 
— If (f> = until((/)i, (f)2) then 

r(0) = r(0i) U r{(j)2) U {formula{n^),until{n(i„n(f,^,n^^)}; 

— If (j) — always(0i) then r{(j)) — r(0i) U {formula{n^),always(n^,n^-^)}; 

— If (j) = eventually then r((/)) = r{(j)i) U {formula(n^), eventually {n^^ "0i)}- 

For simplicity, the names assigned to a constraint can be used in encoding other 
constraints. For example, the constraints (j) = and(/, acad{g, h)) is encoded by the 
atoms 

formula{n^). 
and{n^, g, h). 
formula(n^). 
and{n^,f, n^). 

We note that the above encodings can be generated automatically using a program 
front-end to smodels that is available on the web-site containing the experimental 
results presented in this paper. Note that during the grounding phase of smodels 
(by Iparse), atoms of the form formula{., .) will be removed. For this reason, we 
use the second encoding in our experiments because it is easier to deal with changes 
in the constraints used for encoding the control knowledge. 

We now present the formula-independent rules for evaluating temporal con- 
straints. As with defining the satisfaction of temporal constraints, we first consider 
goal-independent temporal constraints. The rules needed for evaluating temporal 
constraints whose first level operator is different than the goal operator are as 
follows: 



hf{L,T) ^ liter al{L),holds{L,T). (22) 

hf{N,T) ^ formula{N),and{N,Ni,N2), (23) 

hf{Ni,T),hfiN2,T). (24) 

hf{N,T) ^ formula{N),oriN,NuN2),hf{Ni,T). (25) 

hf{N,T) ^ formula{N),or{N,Ni,N2),hf{N2,T). (26) 
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hf{N,T) ^ 


formula(N), neqationlN . N-\ ] ], not hf(N-\ , T). 


(27) 


hf{N,T) ^ 


formula{N), unUl{N, Ni, N2), 


(28) 




T < T', hf.during{Ni, T, T'), hf{N2, T'). 




hf(N,T) ^ 


formula(N), alwavs(N, N-\), 


(29) 




hf_during{Ni, T, n). 




hf{N,T) ^ 


formula{N), eventually{N, Ni), 


(30) 




hf{Ni,T'),T<T'. 




hf{N,T) ^ 


formula{N) , next{N, Ni),hf{Ni,T+l). 


(31) 


hf_during{N,T,T) ^ 


hf{N,T). 


(32) 


hf.during{N,T,T') ^ 


hf{N, T), T < T', hf_during{N, T + 1, T'). 


(33) 



The meaning of these rules is straightforward. The first rule defines the truth value 
of an atomic formula (a literal). Rule (23) says that a conjunction holds if its 
conjuncts hold. Rules (25)- (26) say that a disjunction holds if one of its disjuncts 
holds. The rule (27) states that the negation of a formula holds if its negation does 
not hold. Its correctness is due to the assumption that initial states are complete. 
Rules (28)-(33) deal with formulae containing temporal operators. The constant n 
denotes the maximal length of trajectories that we are looking for. In the following, 
we refer to this group of rules by IT formula ■ 

The next theorem shows that rules (22)-(33) correctly implement the semantics of 
goal-independent temporal formulae. 

Theorem 4.9. Let 6" be a finite set of goal-independent temporal formulae, 
/ = (soj Si • • • Sn) be a sequence of states, and 

^formula{S, I) = ^formula U r(7) U r{S) 

where 

— r{S) is the set of atoms used in encoding S, and 

— r{I) = Llf^Q{holds{l,t) | Hs a fluent literal and I G St}. 

Then, 

(i) The program Tiformuia{S, I) has a unique answer set, X. 

(ii) For every temporal formula 4> such that formula{n^) G r{S), (j) is true in It, 
i.e.. It ^ (p, if and only if hf{n^, t) belongs to X where It = {st, ■ ■ ■ Sn)- 



Proof. See Appendix A.2 □ 

Having defined temporal constraints and specified when they are satisfied, adding 
temporal knowledge to a planning problem in answer set planning is easy. We 
must encode the knowledge as a temporal formula^^ and then add the set of rules 
representing this formula and the rules (22)-(33) to 11. Finally, we need to add the 

^^A set of temporal formulcie can be viewed as a conjunction of temporal formulae. 
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constraint that requires that the goal is true at the final state and the temporal 
formula is satisfied. More precisely, for a planning problem {D, T, A) and a goal- 
independent temporal formula ^, let H^^^ be the program consisting of 

— the program n„ (Defined as in Sub-section 3.4), 
—the rules (22)- (33) 

— the rules encoding (p and the constraint ± ^ not hf{n^, 0). 

The next theorem is about the correctness of 11^^^. 

Theorem 4.10. For a planning problem {D,r, A) with a consistent action the- 
ory {D, r) and a goal-independent temporal formula ^, 

(i) if sqUo ■ ■ ■ ttn-iSn is a trajectory achieving A and I \= (f) where / = (sq, . . . , Sn), 
then there exists an answer set M of H^^^ such that 

(1) occ{ai, i) € M iov i € {0, . . . ,n - 1}, 

(2) Si = Si{M) for i e {0, . . . , n}, and 

(3) hf{n^,0)GM. 
and 

(ii) if M is an answer set of H^^^, then there exists an integer < fc < n such 

that 

(1) so{M)ao . . . ak-iSk{M) is a trajectory achieving A where occ{ai, i) G M for 
<i < k and 

(2) I\=(l) where / = (so(M), . . . , s„(M)). 

Proof. Observe that the set of literals of the program n„, lit{Iln), is a splitting 
set of the program H^^^ and n„ = 6(jj(nn)(n^^^). Thus, M is an answer set 
of n^^^ iS M = X UY where X is an answer set of n„ and Y is an answer 
set of e;jj(n„) (n^^'^ \ Iln,X) which consists of the set of rules (23)-(33), the set 
of atoms {hf{l,t) \ holds{l,t) € X}, the rules encoding 0, and the constraint 
_L <— not hf{n^,0). This constraint implies that hf{n^,0) must belong to every 
answer set M of H^^^. 

We now prove (i). It follows from Theorem 3.2 that there exists an answer set X 
of n„ such that the first two conditions are satisfied. Because I \= (j), we can apply 
Theorem 4.9 to show that any answer set Y of eiit(u„)(^n^^ \ ^n,X) contains 
hf{n^, 0). Thus, X U y is an answer set satisfying (i). 

To prove (ii), it is enough to notice that the answer set X of n„, constructed in the 
proof of Lemma .4, can be used to construct an answer set M of H^^^ such that 
M satisfies (ii). □ 

The above theorem shows how control knowledge represented as goal-independent 
temporal formulae can be exploited in answer set planning. We will now extend 
this result to allow control knowledge expressed using goal-dependent temporal 
formulae. Based on Definition 4.8, where satisfaction of goal-dependent temporal 
formulae is defined, we will need to encode A \= ij; where A is the goal and goal(^/') 
is a formula occurring in a control knowledge that we wish to use. To simplify this 
encoding we make the same assumption that is made in most classical planning 
literature including [Bacchus and Kabanza 2000]: the goal A in a planning problem 
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{D, r, A) is a set of literals and each goal formula occurring in a temporal formula 
representing our control knowledge is of the form goal(F) where F is a fluent 
literal. In the rest of this section, whenever we refer to a planning problem or a 
goal-dependent temporal formula we assume that they satisfy this assumption. Let 
{D, r, A) be a planning problem and ^ be a temporal formula. n^^^+Goai ^j^g 
program consisting of H^^^, the set of atoms {fomula{ngoali) \ goal(Z) is a goal 
formula occurring in <j), and the set of rules 

hf{ngoaif,T) ^ time{T) (34) 

for each f € A. Intuitively, these rules assert that / is a part of the goal A. The 
next theorem is about the correctness of 11^^^+'^''"'. 

Theorem 4.11. For a planning problem {D, F, A) with a consistent action the- 
ory {D, r) and a temporal formula c^, 

(i) if soflo • ■ • a„_is„ is a trajectory achieving A and (/, A) |= (j) where I = 
(sq, . . . , s„), then there exists an answer set M of li^^P+Goai g^gj^ ^j^^^ 

(1) occ{ai, e M for i e {0, . . . , n - 1}, 

(2) Si = Si{M) for i e {0, . . . , n}, and 

(3) /i/(n^,0) gM. 
and 

(ii) if M is an answer set of 11^''^'''+'^°"', then there exists an integer < A; < n 
such that 

(1) so(-M")ao . . . ak-iSk{M) is a trajectory achieving A where occ{ai,i) G M for 
< i < fc and 

(2) {I, A)\=4> where 7 = (so(M), . . . , s„(M)). 

Proof. To prove this theorem, we first need to modify Theorem 4.9 by (i) 
allowing goal-dependent formulae to be in the sot S: (ii) adding a goal A and the 
rule (34) to the program II of Theorem 4.9. The proof of this modified theorem is 
very similar to the proof of Theorem 4.9. We omit it here for brevity. This result, 
together with Theorem 3.2, proves the conclusion of this theorem. □ 

4.2 Procedural Knowledge 

Procedural knowledge can be thought of as an under-specified sketch of the plans to 
be generated. The language constructs that we use in this paper to describe proce- 
dural knowledge are inspired by GOLOG, an Algol-like logic programming language 
for agent programming, control and execution; and based on a situation calculus 
theory of action [Levesque ct al. 1997]. GOLOG has primarily been used as a 
programming language for high-level agent control in dynamical environments (see 
e.g. [Burgard et al. 1998]). Although a planner can itself be written as a GOLOG 
program (See Chapter 10 of [Reiter 2000]), in this paper, we view a GOLOG pro- 
gram as an incompletely specified plan (or as a form of procedural knowledge) that 
includes non-deterministic choice points that are filled in by the planner. For ex- 
ample, the procedural knowledge (which is very similar to a GOLOG program) 
di', (I2; (asla^la^); f represents plans which have ai followed by 02, followed by one 
of as, 04, or 05 such that / is true in the following (terminating) state of the plan. 
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A planner, when given this procedural knowledge needs only to decide which one 
of as, a4, or it should choose as its third action. 

We now formally define the syntax of our procedural knowledge, which - keeping 
with the GOLOG terminology - we refer to as a program. A program is built from 
complex actions and procedures. Complex actions, procedures, and programs are 
constructed using variables, actions, formulae, and procedural program constructs 
such as sequence, if-then-else, while-do, or choice, etc. They are defined as 
follows. 

Definition 4.12. A complex action 6 with a sequence of variables Xi, . . . ,Xn is 

— a basic complex action: 

— an expression of the form a{ai, . . . , am) where a is an m-ary action name, crj 

is either a variable or a constant of the type U, and if ai is a variable then it 

belongs to {Xi, . . . , X„} or 
— an expression of the form (f) where is a formula whose free variables are from 

{Xi, . . . , Xn}; 

— a sequence: an expression of the form Si ; 62 where 61 and 62 are complex actions 

whose free variables are from {Xi, . . . , Xn}; 
— a choice of actions: an expression of the form 61 \ ... \ 5k where Jj's are complex 

actions whose free variables are from {Xi, . . . , X„}; 

— a if-then-else: an expression of the form if cp then di else 62 where </) is a formula 
and 5i and 62 are complex actions whose free variables are from {Xi, . . . 

— a while-do: an expression of the form while (j> do Si where is a formula and i5i 
is a complex action whose free variables arc from {Xi, . . . , Xn}', 

— a choice of arguments: an expression of the form pick(y, {ci, . . . , c„}. Si) where 
Y ^ {Xi, . . . , Xn}, {ci, . . . , Cn} is a set of constants, and i5i is a complex action 
whose free variables are from {Xi, . . . , Xn, Y}; and 

— a procedure call: an expression of the form p{Xi, . . . , X„) where p is a procedure 
name whose variables are Xi, . . . , Xn- 

Definition 4.13. A procedure with the name p and a sequence of variables 
Xi, . . . , Xn is of the form {p{Xi , Xn) : S) where 5, called the body, is a complex 
action whose free variables are from {Xi, . . . , Xn}- 

A procedure {p{Xi, . . . , Xn) : S) is called a nested procedure if 5 is a procedure call. 

Intuitively, a complex action S represents a sketch of a plan whose variations are 
given by its variables and its structure. The execution of 5 is done recursively over 
its structure and starts with the instantiation of Xi, . . . , Xn with some constants 
ci, . . . , c„. In the process, an action might be executed, a formula might be evalu- 
ated, other complex actions or procedures might be instantiated and executed. In 
other words, the execution of S might depend on the execution of other complex 
actions. Let 5 he a. complex action with variables {Xi, . . . and ci, . . . ,c„ be 
constants. In the following, we define 

— the ground instance of S with respect to the substitution {Xi/ci, . . . ,Xn/cn}, 
denoted by J(ci, . . . , c„), and 
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— the set of complex actions that the execution of (5(ci, . . . , c„) might depend on, 
denoted by prim{S{ci , . . . , c„)). 

5{ci, . . . , Cn) and prim{S{ci, . . . , c„)) are defined recursively as follows: 

— if S ~ a((7i, . . . , (Jm), then S{ci, . . . , c„) is the action a{c'i, . . . , c'^) where = Cj 
if CTi is the variable Xj and c'- = ai if CTj is a constant and prim{5{ci, . . . , c„)) = 
{a{c[,...,c'^)}, 

— if S = (j) then S{ci, . . . , c„) = <^(ci, . . . , c„) where 0(ci, . . . , Cn) is obtained 

from by simultaneously replacing every free occurrence of Xi in (p by Cj and 

prini{6{ci, c„)) = {0(ci, . . . , c„)}, 
— if 6 = Si;62 then 6{ci, . . . ,c„) = ^i(ci, . . . ,c„); (52(ci, . . . ,c„) and 

prim{S{ci, . . . , c„)) = prim((5i(ci, . . . , c„)) U prjm((52(ci, . . . , c„)), 
— if S = Si \ . . . \ Sk then (5(ci, . . . ,c„) = (5i(ci, . . . , c„) | . . . | 5fe(ci, • • • ,c„) and 

prim((5(ci,...,c„)) = Ui=iP^^"^(<^i(ci, • • • ,c„)), 
— if 5 = if <^ then i5i else S2 then 

6{ci, . . . ,Cn) = a 4>{ci, . . . ,c„) then (5i(ci, . . . , c„) else ^2(01, . . . , c„) and 

prim{6{ci, . . . ,Cn)) = {(/)(ci, . . . , c„)} U y»rim((5i(ci, . . . , c„)) U 

prim{d2{ci, . . . ,c„)), 
— if S — while (f) do 5i then 5(ci,...,c„) = 

while 0(ci, . . . , c„) do (5i(ci, . . . , c„) and 

prim{S{ci, . . . , c„)) = {«()(ci, . . . , c„)} U prim{52{ci, c„)), 
—if (5 = pick(r, {j/i,...,t/„},5i) then ^(ci,...,c„) = 5i(ci, . . . ,c„,yj) for some j, 

1 < j < m, and 

prim((5(ci,...,c„)) = {prim((5i(ci, . . . , c„, y^)) | j = l,...,m}; and 
— If S = p{Xi,...,Xn) where {p{Xi, . . . ,Xn) : 5i) is a procedure then 
5(ci,...,c„) = Si{ci,...,Cn) and prim{S{ci,...,Cn)) = {p(ci, . . . , c„)} U 

prim{5{ci, . . . ,c„)). 

A ground instance of a procedure {p(Xi, . . . , X„) : 6) is of the form {p{ci, . . . , c„) : 
S{ci, . . . , Cn)) where ci, . . . , c„ are constants and 6{ci, . . . , c„) is a ground instance 
of 5. 

In what follows, S{ci, . . . , c„) (resp. (p(ci, . . . , c„) : 5(ci, . . . , c„))) will be referred 
to as a ground complex action (resp. ground procedure). As with complex actions, 
for a procedure {p{Xi, . . . , Xn) : 6) and the constants ci, . . . , c„, we define the set of 
actions that the execution of p{ci , . . . , c„) might depend on by prim{p{ci , . . . , c„)) = 
prim{S{ci, . . . , c„)). It is easy to see that under the above definitions, a procedure 
p may depend on itself. For example, for two procedures "(p : while do q)" 
and "(g : while 02 do p)", we have that prim{p) = {p, 5, ^i, </'2} and prim{q) = 
{p,q,(j)i,(p2}- Intuitively, this will mean that the execution of p (and q) might 
be infinite. Since our goal is to use programs, represented as a set of procedures 
and a ground complex action, to construct plans of finite length, procedures that 
depend on themselves will not be helpful. For this reason, we define a notion called 
well-defined procedures and limit ourselves to this type of procedure hereafter. We 
say that a procedure p with variables Xi, . . . , Xn is well-defined if there exists no 
sequence of constants ci, . . . , c„ such that p{ci, . . . , c„) G prim{p{ci, . . . , c„)). We 
will limit ourselves to sets of procedures in which no two procedures have the same 
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name and every procedure is well-defined and is not a nested procedure. We call 
such a set of procedures coherent and define programs as follows. 

Definition 4.14 Program. A program is a pair (5, 6) where 5 is a coherent set of 
procedures and 5 is a ground instantiation of a complex action. 

We illustrate the above definition with the following example. 

Example 4.15. In this example, we introduce the elevator domain from 
[Levesque et al. 1997] which we use in our initial experiments (Section 4.4). The 
set of constants in this domain consists of integers between and k representing 
the floor numbers controlled by the elevator. The fluents in this domain and their 
intuitive meaning are as follows: 

— on{N) - the request service light of the floor N is on, indicating a service is 
requested at the floor A'^, 

— opened - the door of the elevator is open, and 

— currentFloor{N) - the elevator is currently at the floor N. 

The actions in this domain and their intuitive meaning are as follows: 

— up{N) - move up to floor N, 

— down{N) - move down to floor N, 

— turnof f{N) - turn off the indicator light of the floor N, 

— open - open the elevator door, and 
— close - close the elevator door. 

The domain description is as follows: 

causes(up(A'"), currentFloor{N), {}) 
ca.uses{down{N) , currentFloor{N) , {}) 
causes{turnoff{N), ^on{N), {}) 
causes(open, opened, {}) 
causes(dose, ^opened, {}) 
_ I caused{{currentFloor{M)},^currentFloor{N)) for all M ^ N 
^elevator - < executable(up(A'), {currentFlooriM), ^opened}) for all M < A 

executable(down(iV), {currentFloor{M), ^opened}) for all M > A 
executable(torno//(A), {currentFloor{N)}) 
executable(open, {}) 
executable(dose, {}) 
executable(nuZZ, {}) 

We consider arbitrary initial states where opened is false, currentFloor{N) is true 
for a particular N and a set of on{N) is true; and our goal is to have ^on{N) for all 
N . In planning to achieve such a goal, we can use the following set of procedural 
domain knowledge. Alternatively, in the terminology of GOLOG, we can say that 
the following set of procedures, together with the ground complex action control, 
can be used to control the elevator, so as to satisfy service requests - indicated by 
the light being on - at different floors. That is, the program for controlling the 
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elevator is (S*, control) where 



S= i 



{go.floor{N) 
{serve{N) 
{serveM-floor 
{park 
{control 



currentFloor{N)\up{N)\down{N)) 

go-floor{N);turnoff{N); open; close) 

pick(iV, {0, . . . , A;}, {on{N); serve{N))). 

if currentFloor{0) then open else [down{0);open]) 

[while 3A''.{0, . . . ,k} [on{N)] do serve-a-floor];park). 



Observe that the formula 3N.d{N) [on{N)], as discussed before, is the shorthand 
of the disjunction 

or(on(0), or(on(l), . . . , or{on{k — 1), on{k)))) 

where 0, . . . , fc are the floor constants of the domain. □ 

The operational semantics of programs specifies when a trajectory 
soflO'Si • • • Or)-i.s,i, denoted by a, is a trace of a program {S,6). Intuitively, 
if a is a trace of a program {S, 5) then that means oq, . . . , a„_i is a sequence of 
actions (and a is a corresponding trajectory) that is consistent with the sketch 
provided by the complex action S of the program {S, 6) starting from the initial 
state So- Alternatively, it can be thought of as the program {S, d) unfolding to the 
sequence of actions ao, . . . , a„_i in state so- We now formally define the notion of 
a trace. 

Definition 4.16 Trace. Let p = {S,S) be a program. We say that a trajectory 
sottosi . . . a„_is„ is a trace of p if one of the following conditions is satisfied: 

— 6 = a and a is an action, n = 1 and ao = a; 
— 6 = (f), n = and (p holds in sq; 

— 6 = Si] 62, and there exists an i such that Soao...s, is a trace of {S,5i) and 

trace of {S, S2); 

— 15 = (5i I . . . I Sn, and soao . - - a„_is„ is a trace of {S, Si) for some i; 

— 6 = if then else 62, and soOq . . . a„_is„ is a trace of {S, 61) if ^ holds in 

So or Sotto - • - ^n—l'^n 

is a trace of {S, S2) if -^(p holds in sq; 
— 6 = while (j) do di, n = and -i0 holds in sq, or 

(p holds in sq and there exists some i > such that soao - - - Sj is a trace of {S, di) 

and trace of {S. S); or 

— 6 = pick(F, {yi, ym}, Si) and soaoSi . . . a„_is„ is a trace of {S, Si{yj)) for 

some j, I < j < m. 
— 5 = p{ci, . . . , c„) where {p{Xi, . . . , X„) : Si) is a procedure, and 

soaoSi . . . a„-iSn is a trace of {S, Si{ci, . . . , c„)). 

□ 

The above definition allows us to determine whether a trajectory a constitutes a 
trace of a program {S,5). This process is done recursively over the structure of S. 
More precisely, if S is not an action or a formula, checking whether a is a trace of 
{S, 6) amounts to determining whether a is a trace of {S, S') for some component S' 
of 5. We note that because S is grounded, S' is also a ground complex action; thus, 
guaranteeing that {S, S') is a program and hence the applicability of the definition. 
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It is easy to see that 5' belongs to prim{d). Because of the coherency of S and the 
finiteness of the domains, this process will eventually stop. 

We will now present the smodels encoding for programs. The encoding of a pro- 
gram (5, S) will include the encoding of all procedures in S and the encoding of 
6. The encoding of a complex action or a procedure consists of the encoding of all 
of its ground instances. Similar to the encoding of formulae, each complex action 
S will be assigned a distinguished name, denoted by ng, whenever it is necessary. 
Because procedure names are unique in a program, we assign the name p(ci, . . . , Cn) 
to the complex action 6{ci, . . . , c„) where {p{Xi, . . . , X„) : 5) is a procedure and 
ci, . . . , c„ is a sequence of constants. In other words, ns(^ci,...,c„) = p{ci, • • • , Cn). 
We note that since the body of a procedure is not a procedure call, this will not 
cause any inconsistency in the naming of complex actions. We now describe the set 
of rules encoding a complex action S, denoted by r{S), which is defined inductively 
over the structure of 5 as as follows: 

— For d = a ov d = 4>, r{S) is the action a or the rules encoding (j), respectively. 

— For 5 = 6i;62, r{5) = {sequence{ns,nsi,ns2)} U r{Si) U r{S2)- 

—For S ^ Si \ 52 ■ ■ ■ \ Sn, r{S) = Ui=i,...,n''(^i) U {m(n5,,n5)|i = l,...,n}U 

{choice Action{ns)} . 

— For (5 = if then (5i else 62, r{S) = r{4>)Lir{6i)Ur{S2)l-i{if{ns,n^,ns^,ns2)}. 
— For 5 = while do Si, r{S) = r{^) U r{Si) U {'while{ns,n^,nsi)}. 
—For S = pick(F, {yi,..., Vm}, Si), 

j—m 

r{S)= y r{Siiyj))UR 

where R = {choiceArgs{ns,ns^(^y.-^) \ j = 1,. . . ,m)}. 
— For S = p{ci, . . . , Cn) where {p{Xi, . . . , X„) : ^i) is a procedure, r{S) = {S}. 

A procedure {p{Xi, .... Xn) : Si) is encoded by the set of rules encoding the collec- 
tion of its ground instances. The encoding of a program {S, S) consists of r{S) and 
the rules encoding the procedures in S. Observe that because of 6"s coherence, the 
set of rules encoding a program {S, S) is uniquely determined. 

Example 4.17. In this example we present the encoding of the program 

(S, control) from Example 4.15. 

We start with the set of rules encoding the ground procedure {go_floor{i) : 
currentFloor{i) | up{i) \ down{i)) where i is a floor constant. First, we assign 
the name go-floor{i) to the complex action currentFloor{i) \ up(i) \ down{i) and 
encode this complex action by the set r{go-floor{i)). This set consists of the fol- 
lowing facts: 

choice Action{go-f loor{i)) . 
in{currentFl oor (i ) , go_ floor [i)). 
in{up{i), go-floor{i)). 
in{down{i), go-floor{i)). 

Similar atoms are needed to encode other instances of the procedure go-floor{N). 
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For each floor i, the following facts encode the complex action 
go-floor{i),turnoff{i), open, close, which is the body of a ground instance 
of the procedure {serve{N) : go-floor{N),turnoff{N), open, close): 

sequence{serve(i) , go_floor{i), serve J,ailA(i)) . 
sequence{serveJ,ailA (i), turnof f{i) , openjzlose) . 
sequence{openjdose, open, dose). 

To encode the procedure {servc-U- floor : pick(A'', {0, 1, . . . , fc}, {on{N); serve{N))), 
we need the set of rules which encode serve{i), < i < k, (above) and the rules 
encode the complex action on(i): serve{i) for every i. These rules are: 

sequence{body-serve-a_floor{i), on{i), serve{i)), 

where body serve ji-floor{i) is the name assigned to the complex action 
on{i); serve{i), and the following rule: 

choice Args{serve-a-floor, body_serve_a_floor{i)). 

The following facts encode the procedure {park : 
if currentFloor{0) then open else [down; park]): 

if {park, currentFloor{0) , open,park_l). 
sequence{parkA, down{0), open). 

Finally, the encoding of the procedure control consists of the rules encoding the 
formula 

or(on(0), or(on(l), . . . , or{on{k — 1), on(fc)))) 

which is assigned the name 'existOn' and the following rules: 

sequence{control, whileservicejneeded, park) . 
while{whileservice-needed, existOn, serve ja. floor). 

□ 

We now present the AnsProlog rules that realize the operational semantics of pro- 
grams. We define a predicate trans {P, Ti, T2) where P is a program and Ti and T2 
are two time points, Ti < T2. Intuitively, we would like to have trans{p,t\,t2) be 
true in an answer set M iff (M)ati • • • (it2-i.st2 {M) is a trace of the program p^^. 



trans{A,T,T +1) ^ action{A),occ{A,T). (35) 

trans{F.Ti.Ti) ^ formula{F),hf{F,Ti). (36) 

trans{P, Ti, T2) ^ sequence{P, Pi, P2), TI < T' < Ta, (37) 

trans{Pi,Ti,T'), trans{P2, T' , T2). (38) 

trans{N,Ti,T2) ^ choice Action{N), (39) 

m(Pi , N) , trans{Pi , Ti , T2) . 

trans{I,T,,T2) ^ if{I,F,P,,P2),hf{F,T,),trans{P,,T,,T2). (40) 



trans{I,Ti,T2) ^ if{I,F,Pi,P2),nothf{F,Ti),trans{P2,Ti,T2). (41) 

Recall that we define Si{M) = {holds{f,i) £ M | / is a fluent literal}. 
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trans{W,Ti,T2) ^ while{W,F,P),hf{F,Ti),Ti < T' < T2 
trans{P, Ti,T'),trans{W, T', T2). 



(42) 



trans{W,T,T) ^ while{W,F,P),not hf{F,T). 
trans{S,Ti,T2) ^ choiceArgs{S, P),trans{P,Ti,T2). 
trans{nun,T,T) ^ 



(43) 
(44) 
(45) 



Here null denotes a dummy program that performs no action. This action is 
added to allow programs of the form if (f then p to be considered (this will 
be represented as if (p then p else null). The rules are used for determining 
whether a trajectory - encoded by answer sets of the program n„ - is a trace of 
a program or not. As with temporal constraints, this is done inductively over the 
structure of programs. The rules (35) and (36) are for programs consisting of an 
action and a fluent formula respectively. The other rules arc for the remaining cases. 
For instance, the rule (42) states that the trajectory from Ti to T2 is a trace of a 
while loop "while F do P", named W and encoded by the atom while{W, F, P), 
if the formula F holds at Ti and there exists some T' , Ti < T' < T2 such that the 
trajectory from Ti to T' is a trace of P and the trajectory from T' to T2 is a trace 
of W; and the rule (43) states that the trajectory from T to T is a trace of W if the 
formula F docs not holds at T. These two rules effectively determine whether the 
trajectory from Ti to T2 is a trace of while{W, F, P). The meanings of the other 
rules are similar. 

Observe that wc do not have specific rules for complex actions which are proce- 
dure calls. This is because of every trace of a procedure call p{ci, . . . ,c„), where 
p{Xi, . . . , Xn) : (5) is a procedure, is a trace of the complex action (5(ci, . . . , c„) — 
whose name is p{ci, . . . , c„), as described earlier — and vice versa. Furthermore, S 
is not a procedure call , traces of Si can be computed using the above rules. The 
correctness of the above set of rules (see Theorem 4.18) means that procedure calls 
are treated correctly in our implementation. 

To specify that a plan of length n starting from an initial state must obey the sketch 
specified by a program p = {S, S), all we need to do is to add the rules encoding p 
and the constraint •*— not trans{np, 0, n) to n„. We now formulate the correctness 
of our above encoding of procedural knowledge given as programs, and relate the 
traces of program with the answer sets of its AnsProlog encoding. Let 11^°'°^ be 
the program obtained from n„ by (i) adding the rules (35)-(45) and (22)-(33), (ii) 
adding r{p), and (iii) replacing the goal constraint with _L <— not trans{np,0,n). 
The following theorem is similar to Theorem 3.2. 

Theorem 4.18. Let {D,T) be a consistent action theory and p = {S,6) be a 
program. Then, 

(i) for every answer set M of n^°'°f with occ{ai,i) G M for i e {0, . . . , n — 1}, 
so(M)ao . . . a„_is„(M) is a trace of p; and 

(ii) if Soflo • ■ • a„_is„ is a trace of p then there exists an answer set M of 11^°'°^' 
such that Sj = Sj (M) and occ{ai, i) G M for j G {0, . . . , n} and i € {0, . . . , n — 




□ 
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To do planning using procedural constraints all we need to do is to add the goal 
constraint to 11^°'°^, which will filter out all answer sets where the goal is not 
satisfied in time point n, and at the same time will use the sketch provided by the 

program p. 

4.3 HTN-Based Knowledge 

The programs in the previous subsections are good for representing procedural 
knowledge but prove cumbersome for encoding partial-ordering information. For 
example, to represent that any sequence containing the n programs pi, . . . in 
which pi occurs before ^2, is a valid plan for a goal A, one would need to list all 
the possible sequences and then use the non-determinism construct. For n = 3, the 
program fragment would be {pi',P2'tP3\pi]Pz]P2\pz]Pi]P2)- Alternatively, the use of 
the concurrent construct || from [Dc Giacomo et al. 2000], where p\\q represents the 
set consisting of two programs p\ q and g; p, is not very helpful either. This deficiency 
of pure procedural constructs of the type discussed in the previous section prompted 
us to look at the constructs in HTN planning [Sacerdoti 1974]. The partial-ordering 
information allowed in HTN descriptions serves the purpose. Thus all we need is 
to add constraints that says pi must occur before P2- 

The constructs in HTN by themselves are not expressive enough either as they 
do not have procedural constructs such as procedures, conditionals, or loops, and 
expressing a while loop using pure HTN constructs is not trivial. Thus we decided 
to combine the HTN and procedural constructs and to go further than the initial 
attempt in [Baral and Son 1999] where complex programs are not allowed to occur 
within HTN programs. 

We now define a more general notion of program that allows both procedural and 
HTN constructs. For that we need the following notion. Let = {{pi : 5\), . . . ,{pk 
5k)} be a set of procedures with free variables {^i, . . . , Xn}- 

— An ordering constraint over S has the form pi -< pj where Pi ^ pj. 
— A truth constraint is of the form {pi,4>), {<f>TPi); or {pi, (f),Pt), where is a formula 
whose free variables are from the set {Xi, . . . , X^}- 

Given a set of procedures S and a set of constraints C over S, the execution of S will 
begin with the grounding of S and C, i.e., the variables Xi, . . . , Xn are substituted 
by some; constants ci, . . . , c„. The constraints in C stipulate an order in which the 
procedures in S is executed. The intuition behind these types of constraints is as 
follows: 

— An ordering constraint pi -< pj requires that the procedure pi has to be executed 
before the procedure pj . 

— A truth constraint of the form {pi,(j)) (resp. {(i),pi)) requires that immediately 
after (resp. immediately before) the execution oi pi, (p must hold. 

— A constraint of the form {pi,(j),pt) indicates that (j) must hold immediately after 
Pi is executed until pt begins its execution. 

Because a constraint of the form {pi,(j),Pt) implicitly requires that pi is executed 
before pt, for convenience, we will assume hereafter that whenever {pi ,4',pt ) belongs 
to C, so does Pi -< Pt- 
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The definition of general complex actions follows. 

Definition 4.19 General Complex Action. For an action theory (£), F), a general 
complex action with variables Xi, . . . , X„ is either 

— a complex action (Definition 4.12); or 

— a pair (S, C) where E is a set of procedures and C is a set of constraints over S 
and the variables of each procedure in S are from Xi, . . . , X„. 

The definition of a procedure or program does not change. The notion of ground 
instantiation, dependency, and well-definedness of a procedure can be extended 
straightforwardly to general programs. We will continue to assume that programs 
in consideration are well-defined. As in the case of programs, the operational seman- 
tics of general programs is defined using the notion of trace. In the next definition, 
we extend the notion of a trace to cover the case of general programs. 

Definition 4.20 Trace of general programs. Let p — (S, 5) be a general program. 
We say that a trajectory SqUq . . . a„_is„ is a trace of p if one of the following 
conditions is satisfied: 

— soflo • • • dn-iSn and {S, 5) satisfy one of the condition in Definition 4.16; or 

—6 = (E, C), E = {{pi : Si), {pk, 4)}, and there exists jo=0 < ji < • • • < jk=n 
and a permutation (zi, . . . , ij.) of (1, . . . , fc) such that the sequence of trajectories 
tti = Soao-.-Sji, a2 = sj-^aj-^ . . . sj^, . . ., ak = Sj^_-^^aj^_^...Sn satisfies the 
following conditions: 

(1) for each I, 1 <l <k, ai \s a, trace of {S, 5^), 

(2) lint ~< ni & C then it < ii, 

(3) if {(f), ui) E C (or {rii, (p) e C) then (f) holds in the state Sj,_i (or Sj,), and 

(4) if (nt, 0, rii) S C then (j) holds in sj^,. . . , Sj^_^ . 

□ 

The last item of the above definition can be visualized by the following illustration: 



"1 °2 a, cjj. 

T T T T 

II I I 

trace of (S,<5ij) trace of (Sjij^) trace of (S,(5ij) trace of (S,(5ij^) 

Next we show how to represent general programs. Similar to programs in the 
previous section, we will assign names to general programs and their elements. A 
general program p = {S, C) is encoded by the set of atoms and rules 

r(jj) = {htn{np, ns, nc)} U r{S) U r(C) 

where r{S) and r(C) is the set of atoms and rules encoding S and C and is described 
below. Recall that 5 is a set of programs and C is a set of constraints. Both S and C 
are assigned unique names, ns and nc- The atoms set{ns) and set{nc) are added 
to r{S) and r(C) respectively. Each element of S and C is encoded by a set of rules 
which are added to r{S) and r(C), respectively. Finally, the predicate in(., .) is used 
to specify what belongs to S and C, respectively. Elements of C are represented by 
the predicates order{*,+,+), postcondition{*,+,—), precondition{*,—,+), and 
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maintain{*,+, —,+) where the place holders '+', and '-' denote the name 
of a constraint, a general program, and a formula, respectively. For example, if 
Til -< belongs to C then the set of atoms encoding C will contain the atom 
in{order{no,ni,n2),nc) where no and nc are the names assigned to the ordering 
constraint m -< n2 and C, respectively. Similarly, if C contains (m, </3, 712) then 
in{rnaintain{n(),nip,ni,n2),nc) (again, no and nc are the name assigned to the 
truth constraint ni -< 77.2 and C, respectively) will belong to the set of atoms 
encoding C. 

In the following example, we illustrate the encoding of a general program about the 
blocks world domain. 

Example 4.21. Consider a general program, (5, C), to build a tower from blocks 
a, 6, c that achieves the goal that a is on top of b and b is on top of c, i.e., the goal 
is to make on(a, b) A on(6, c) hold. We have S = {move{b, c), move{a, b)}, and 



C 



o 

fi 
h 

h 
1/4 



move{h,c) < move{a,b), 
{clear{b), move{b, c)), 
{clear{c), move{b, c)), 
{clear{b), move{a, 6)), 
{dear (a), move{a, b)) 



The constant preceding the semicolon is the name assigned to the formula of C. 

The encoding of p = (5, C) is as follows: 

—r{p) = {htnip,ns,nc)} U r{S) U r{Cy, 

— r{S) = {set{ns),in{move{a,b),ns),in{move{b,c),ns)}; and 

— r{C) consists of 

— the facts defining nc and declaring its elements 

{set{nc), in{o, nc),in{fi, nc),in{f2,nc), in{fz, nc), m(/4, nc)} 

— the facts defining each of the constraints in C: 

— the order constraint o: order{o, move{b, c),move{a, b)), 
— the precondition constraints /i, . . . , /j: 

— precondition{fi, clear{b) , move{b, c)), 

— precondition{f2, clear {c),move{b, c)), 

— precondition{f3, clear (b) , move{a, 6)), and 

— precondition{fi, clear{a), move{a, b)). 

□ 

We now present the AnsProlog rules that realize the operational semantics of gen- 
eral programs. For this purpose we need the rules (35)-(45) and the rules for 
checking the satisfiability of formulae that were presented earlier. These rules are 
for general programs whose top level structure is not an HTN. For general programs 
whose top level feature is an HTN we add the following rule: 

trans{N,Ti,T2) ^ htn{N, S,C),not nok{N,Ti,T2). (46) 

Intuitively, the above rule states that the general program N can be unfolded 
between time points Ti and T2 (or alternatively: the trajectory from Ti and T2 is a 
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trace oiN) if iV is an HTN construct {S, C), and it is not the case that the trajectory 
from Ti and T2 is not a trace of N. The last phrase is encoded by nok{N,T\,T2) 
and is true when the trajectory from Ti and T2 violates one of the many constraints 
dictated by {S, C). The main task that now remains is to present AnsProlog rules 
that define nok{N,Ti,T2). To do that, as suggested by the definition of a trace 
of a program {S,C), we will need to enumerate the permutations (ii, . . . ,ik) of 
(1, . . . , A;) and check whether particular permutations satisfy the conditions in C. 
We now introduce some necessary auxiliary predicates and their intuitive meaning. 

— hegin{N,I,T^,Ti,T2) - This means that I, a general program belonging to N, 
starts its execution at time T3, and N starts and ends its execution at Ti and T2 

respectively. 

— end{N,I,T4,Ti,T2) - This means that /, a general program belonging to N, 
ends its execution at time T4, and N starts and ends its execution at Ti and T2, 
respectively. 

— between{T3,Ti,T2) - This is an auxiliary predicate indicating that the inequali- 
ties Ti < T3 < T2 hold. 

— not-Used{N,T,Ti,T2) - This means that there exists no sub-program I oi N 
whose execution covers the time moment T, i.e., T < B 01 T > E where B and 
E are the start and finish time of /, respectively. 

— overlap{N,T,Ti,T2) - This indicates that there exists at least two general pro- 
grams Ii and I2 in N whose intervals contain T, i.e., Bi < T < Ei and 
B2 < T < E2 where Bi and Ei (i = 1,2) is the start- and finish-time of /», 
respectively. 

We will now give the rules that define the above predicates. First, to specify that 

each general program / belonging to the general program {S. C), i.e.. I <E must 
start and end its execution exactly once during the time (5, C) is executed, we use 
the following rules: 

l{begin{N, I, Ts, Ti, T2) : hetween{Ts, Ti, Tj)}! ^ htn{N, S, C), (47) 

in{I,S), 

trans{N, Ti, T2). 

l{end{N,I,T3,Ti,T2) :between{T3,Ti,T2)}l ^ htn{N,S,C), (48) 

in{I, S), 

trans{N,Ti,T2). 

The first (resp. second) rule says that / - a program belonging to S - must start 
(resp. end) its execution exactly once between Ti and T2. Here, we use cardinality 
constraints with variables [Niemela et al. 1999] in expressing these constraints. Such 
constraints with variables are short hand for a set of instantiated rules of the form 
(7). For example, the first rule is shorthand for the set of rules corresponding to 
the following cardinality constraint: 

l{begin{N,I,Ti,Ti,T2), . . . ,begin{N,I,T2,TuT2)}l ^ htn{N,S,C), 

in{I,S), 

trans{N,Ti,T2). 
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We now give the rules defining notjused{., ., ., .) and overlap{., ., ., .). 

used{N,T,Ti,T2) ^ htniN , S , C) , in{I , S) , (49) 
begin{N,I,B,Ti,T2), 
end{N,I,E,Ti,T2), 
B <T <E. 

notjused(N,T,Ti,T2) not used{N ,T,Ti,T2). (50) 
overlap{N,T,Ti,T2) ^ htn{N, S,C),in{h, S), (51) 

begin{N,Ii,Bi,Ti,T2), 
end{N,h,EuTuT2), 
m(/2, S), begin{N, /s, -B2, Ti, Ts), 
end{N,l2,E2,Ti,T2), 
Bi <T <Ei,B2 <T <E2,Ii^ h- 

The rule (49) states that if a general program / in starts its execution at B and 
ends its execution at E then its execution spans over the interval [B, E], i.e., every 
time moment between B and E is used by some general program in N. The rule 
(50) states that if a time moment between Ti and T2 is not used by some general 
program in N then it is noLused. The last rule in this group specifies the situation 
when two general programs belonging to N overlap. 

We are now ready to define nok{., ., .). There are several conditions whose violation 
make 7iok true. The first condition is that the time point when a program starts 
must occur before its finish time. Next, each general program belonging to the set 
S of (5, C) must have a single start and finish time. The violation of these two 
conditions is encoded by the following rules which are added to 11. 



nok{N,Ti,T2) ^ htn{N,S,C),in{I,S),T3 > T4, (52) 

begin{N,I,T3,Ti,T2), 

endiN, I,Ti,TuT2). 
nok{N, Ti, T2) ^ htn{N, S, C),in{1, 5), T3 < T4, (53) 

begin{N,I,T3,TuT2), 

end{N,I,n,Ti,T2), 

not trans{I ,T3,T4) . 
nok{N, Ti, T2) ^ htn{N, S, C), Ti<T < T2, (54) 

not.used{N, T,Ti,T2). 
nok{N, Ti, T2) ^ htn{N, S, C),Ti < T < T2, (55) 

overlap{N,T,Ti,T2). 

Together the rules (47)-(55) define when the permutation determined by the set of 
atoms of the form begin{N, I, B, Ti, T2) and end{N, I, E, Ti, T2) violates the initial 
part of condition 8 of Definition 4.20. The rules (47)-(48) require each general 
program in N to have a unique start and finish time and the rule (52) encodes the 
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violation when the finish time is earlier than the start time. The rule (53) encodes 
the violation when the trace of a general program in N does not correspond to its 
start and finish times. The rule (54) encodes the violation when some time point 
on the trajectory of N is not covered by the trace of a general program in N; and 
the rule (55) encodes the violation when the trace of two general programs in N 
overlap. 

The next group of rules encode the violation of conditions 8(b) - 8(d) of Definition 
4.20. 



nok{N,Ti,T2) ^ htn{N, S,C),iniIi, S),begin{N, h, Bi,Ti,T2), (56) 

m(/2, S), begin{N, h, B2,Ti,T2), 

in{0, C), orderiO, h , h),Bi > B2. 
nok{N,Ti,T2) ^ htn{N, S,C),in{Ii, S),end{NJi, Ei,Ti,T2), (57) 

in{l2,S),begin{N,l2,B2,Ti,T2),Ei < Tg < ^2, 

in{0, C) , maintain{0 , F, /i, 12), not hf{F, T3). 
nok{N,Ti,T2) ^ htn{N,S,C),iniI,S),begin{N,I,B,Ti,T2), (58) 

in{0, C),precondition{0, F, I), not hf{F, B). 
nok{N,Ti,T2) ^ htn{N, S,C),in{I, S),end{N, I, E,Ti,T2), (59) 

in{0, C),postcondition{0, F, I), not hf{F, E). 

The rule (56) encodes the violation when the constraint C of the general program 
N = {S, C) contains Ji -< h, but I2 starts earlier than Ji . The rule (57) encodes the 
violation when C contains {Ii,F, I2) but the formula F does not hold in some point 
between the end of Ii and start of I2 ■ The rules (58) and (59) encode the violation 
when C contains the constraint (F, /) or (/, F) and F does not hold immediately 
before or after respectively, the execution of I. 

We now formulate the correctness of our above encoding of procedural and HTN 

knowledge given as general programs, and relate the traces of a general program 
with the answer sets of its AnsProlog encoding. For an action theory {D, T) and 
a general program p, let H^^^ be the AnsProlog program obtained from n„ by 
(i) adding the rules (35)-(45) and (46)-(59), (ii) adding r{p), and (iii) replacing 
the goal constraint with _L <— not trans{np,0,n). The following theorem extends 
Theorem 4.18. 

Theorem 4.22. Let {D,r) be a consistent action theory and p be a general 
program. Then, 

(i) for every answer set M of H^"^^ with occ{ai,i) S M for i e {0, . . . , n — 1}, 

so{M)aQ . . . a„_is„(M) is a trace of p; and 

(ii) if Soflo • • • cin-iSn is a trace of p then there exists an answer set M of ^n^^ 
such that Sj = Sj{M) and occ{ai,i) G M for j e {0, . . . , n} and i £ {0,. . . , n—1} 
and trans{np,0,n) e M. 



Proof. See Appendix A.4 
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As before, to do planning using procedural and HTN constraints all we need to do 
is to add the goal constraint to U^'^'^ , which will filter out all answer sets where 
the goal is not satisfied in time point n, and at the same time will use the sketch 
provided by the general program p. 

4.4 Demonstration Experinnents 

Wc tested our implementation with some domains from the general planning liter- 
ature and from the AIPS planning competition [Bacchus et al. 2000]. In particular, 
we tested our program with the Miconic-10 elevator domain. We also tested our 
program with the Block domain. In both domains, wc conducted tests with proce- 
dural control knowledge. Our motivation was: (i) it has already been established 
that well-chosen temporal and hierarchical constraints will improve a planner's ef- 
ficiency; (ii) wc have previously experimented with the use of temporal knowledge 
in the answer set planning framework [Tuan and Baral 2001]; and (iii) we are not 
aware of any empirical results indicating the utility of procedural knowledge in 
planning, especially in answer set planning. (Note that [Rcitcr 2000] concentrates 
on using GOLOG to do planning in domains with incomplete information, not on 
exploiting procedural knowledge in planning.) 

We report the results obtained from our experiment with the elevator exam- 
ple from [Levesque et al. 1997] (elpl-elp3) and the Miconic-10 elevator domain 
(sl-0,. . . ,s5-0), proposed by Schindler Lifts Ltd. for the AIPS 2000 competition 
[Bacchus et al. 2000]. Note that some of the planners, that competed in AIPS 
2000, were unable to solve this problem. The domain description for this example 
is described earlier in Example 4.15 and the smodels code can be downloaded from 
http : //www. cs .nmsu . edu/~tson/ASPlan/Knowledge. We use a direct encoding of 
procedural knowledge in the Block domain to avoid the grounding problem. For 
this reason, we do not include the results of our experiments on the Block domain in 
this paper. The results and the encodings of this domain are available on the above 
mentioned web site. We note that the direct encoding of procedural knowledge 
in the Block domain yields significantly better results, both in terms of the time 
needed to find a trajectory as well as the number of problems that can be solved. 
Whether or not the methodology used in the Block domain can be generalized and 
applied to other domains is an interesting question that we would like to investigate 
in the near future. 

The initial state for the elevator planning problem encodes a set of floors where 

the light is on and the current position of the elevator. For instance, F — 
{on(l), on(3), on(7), cur rent Floor {4)}. The goal formula is represented by the con- 
junction jg ^ gQQj.-ion(/). Sometimes, the flnal position of the elevator is added 
to the goal. The planning problem is to find a sequence of actions that will serve all 
the floors where the light is on and thus make the on predicate false for all floors, 
and if required take the elevator to its destination floor. 

Since there are a lot of plans that can achieve the desired goal, we can use procedural 
constraints to guide us to preferable plans. In particular, we can use the procedural 
knowledge encoded by the following set of simple GOLOG programs from [Levesque 
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et al. 1997], which wc carhcr discussed in Example 4.15. 

{go-floor{N) : currentFloor{N)\up{N)\down{N)) 

{serve{N) : go-floor{N);turnoff{N); open; close) 

{serve-tt- floor : pic\s.{N,d{N),{on{N);serve{N)))) 

{park : if currentFloor{Q) then open else [down{0);open]) 

{control : [while 3A''.{0, 1, . . . , fc} [on(A'')] do serve_a-floor{N)];park) 

We ran experiments on a Sony VAIO laptop with 256 MB Ram and an Intel Pentium 
4 1.59 GHz processor, using Iparse version 1.00.13 (Windows, build Aug 12, 2003) 
and smodels version 2.27. for planning in this example with and without the 
procedural control knowledge. The timings obtained are given in the following 
table. 



Problem 


Plan 


# Person 


# Floors 


With Control 


Without Control 




Length 






Knowledge 


Knowledge 


elpl 


8 


2 


6 


0.380 


0.140 


elp2 


12 


3 


6 


0.901 


0.230 


elp3 


16 


4 


6 


2.183 


1.381 


elp4 


20 


5 


6 


4.566 


79.995 


sl-0 


4 


1 


2 


0.120 


0.020 


s2-0 


7 


2 


4 


1.201 


0.080 


s3-0 


9 


3 


6 


5.537 


0.310 


s4-0 


15 


4 


8 


64.271 


15.211 


s5-0 


19 


5 


10 


260.183 


1181.158 



As can be seen, the encoding with control knowledge yields substantially better 
performance in situations where the plan length is longer. In some instances with 
small plan lengths, as indicated through boldface in column 6, the speed up due 
to the use of procedural knowledge does not make up for the overhead needed in 
grounding the knowledge. The output of smodels for each run is given in the 
file result at the above mentioned URL. For larger instances of the elevator domain 
[Bacchus et al. 2000] (5 persons or more and 10 floors or more), our implementation 
terminated prematurely with either a stack overflow error or a segmentation fault 
error. 

5. CONCLUSION 

In this paper we considered three different kinds of domain-dependent control 
knowledge (temporal, procedural and HTN-based) that are useful in planning. Our 
approach is declarative and relies on the language of logic programming with an- 
swer set semantics. We showed that the addition of these three kinds of control 
knowledge only involves adding a few more rules to a planner written in AnsProlog 
that can plan without any control knowledge. We formally proved the correctness 
of our planner, both in the absence and presence of the control knowledge. Finally, 
we did some initial experimentation that illustrates the reduction in planning time 
when procedural domain knowledge is used and the plan length is big. 
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In the past, temporal domain knowledge has been used in planning in [Bacchus 
and Kabanza 2000; Doherty and Kvarnstom 1999]. In both cases, the planners are 
written in a procedural language, and there is no correctness proof of the plan- 
ners. On the other hand the performance of these planners is much better^** than 
our implementation using AnsProlog. In comparison, our focus in this paper is 
on the 'knowledge representation' aspects of planning with domain dependent con- 
trol knowledge and demonstration of relative performance gains when such control 
knowledge is used. Thus, we present correctness proofs of our planners and stress 
the case of adding the control knowledge to the planner. In this regard, an interest- 
ing observation is that it is straightforward to add control knowledge from multiple 
sources or angles. Thus say two different general programs can be added to the 
planner, and any resulting plan must then satisfy the two sketches dictated by the 
two general programs. 

As mentioned earlier our use of HTN-based constraints in planning is very different 
from HTN-planning and the recent HTN-based planner [Nau et al. 1999]. Unlike 
our approach in this paper, these planners cannot be separated into two parts: one 
doing planning that can plan even in the absence of the knowledge encoded as HTN 
and the other encoding the knowledge as an HTN. In other words, these planners 
arc not extended classical planners that allow the use of domain knowledge in the 
form of HTN on top of a classical planner. The timings of the planner [Nau et al. 
1999] on AIPS 2000 planning domains are very good though. To convince ourselves 
of the usefulness of procedural constraints we used their methodology with respect 
to procedural domain knowledge and wrote general programs for planning with 
blocks world and the package delivery domain and as in [Nau et al. 1999] we wrote 
planners in a procedural language (the language C to be specific) for these domains 
and also observed similar performance. We plan to report this result in a future 
work. With our focus on the knowledge representation aspects we do not further 
discuss these experiments here. 

Although we explored the use of each of the different kinds of domain knowledge 
separately, the declarative nature of our approach allows us to use the different kinds 
of domain knowledge for the same planning problem. For example, for a particular 
planning problem we may have both temporal domain knowledge and a mixture 
of procedural and hierarchical domain knowledge given as a general program. In 
such a case, planning will involve finding an action sequence that follows the sketch 
dictated by the general program and at the same time obeys the temporal domain 
knowledge. This distinguishes our work from other related work [Huang et al. 1999; 
Kautz and Selman 1998b; Baral and Son 1999; Mcllraith 2000] where the domain 
knowledge allowed was much more restricted. 

A byproduct of the way we deal with procedural knowledge is that, in a preposi- 
tional environment, our approach to planning with procedural knowledge can be 
viewed as an off-line interpreter for a GOLOG program. Because of the declarative 



-•^^This provides a challenge to the community developing AnsProlog* systems to develop AnsPro- 
log* systems that can match or come close to (if not surpass) the performance of planners with 
procedural knowledge. 
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nature of AnsProlog the correctness of this interpreter is easier to prove than the 
earher interpreters which were mostly written in Prolog. 

ELECTRONIC APPENDIX 

The electronic appendix for this article can be accessed in the ACM Digital Li- 
brary by visiting the following URL: http://www.acin.org/pubs/citations/ 
journals/tocl/2008-V-N/pl-URLend. 
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Appendix A - Proofs 

We apply the Splitting Theorem and Splitting Sequence Theorem [Lifschitz and 
Turner 1994] several times in our proof^^. For ease of reading, the basic notation and 
the splitting theorem are included in Appendix B. Since we assume a propositional 
language any rule in this paper can be considered as a collection of its ground 
instances. Therefore, throughout the proof, we often say a rule r whenever we refer 
to a ground rule r. By Ut{'K), we denote the set of literals of a program tt. 

Appendix A.l - Proof of Theorem 3.2 

For a planning problem (D, F, A), let 

TT = n„(D, r, A) \ {(20), _L ^ not goal}, 

i.e., TT is obtained from n„(£),r,A) by removing the rules encoding A and the 
constraint _L <— not goal. Let 5 be a set of literals of the form holds {I, t). Abusing 
the notation, wc say that S is consistent with respect to F (or consistent, for short) 
if for every pair of a fluent / G F and a time moment t, S does not contain both 
holds{f,t) and holds{-^f,t). For a set of causal laws K and a set of fluent literals 
y, let 

Mk{Y) = Y\J{1 I 3caused({pi,...,pfe},0 s.t. F ^ pi A . . . Apfe}. (60) 



-'^■''To be more precise, wc use a modified T.'(Tsioii of tfiese tfieorems since programs in this paper 
contain constraints of the form (6) which were not discussed in [Lifschitz and Turner 1994]. The 
modification is discussed in Appendix B. 
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It is easy to see that Mk{Y) is a monotonic function and bounded above by F. Thus, 



the sequence {M}^{Y))i<^, where M^{Y) = Y and M'+^{Y) = MK{M}^iY)) for 



i > 0, is a convergent sequence. In other words, there exists some z > such 
that M^j^^{Y) = M^j^{Y) for j > i. Let C{K,Y) = Ui<oo^lf(^)- We have that 
C{K, Y) is closed under K. We can prove the following lemma. 

Lemma .1. For a set of causal laws K and a set of fluent literals Y, 

(i) C{K,Y) is consistent and C{K,Y) = CIk{Y) iff CIk{Y) is defined; and 
{2) C{K,Y) is inconsistent iff CIk{Y) is undefined. 

Proof. The lemma is trivial for inconsistent set of literals Y. We need to prove 
it for the case where Y is consistent. 

(1) Consider the case where C{K, Y) is consistent. From the definition of the 
function Mk in (60), wc have that Mk{S) C CIk{S) for every set of literals S. 
Therefore, M\^{Y) C CIk{Y) for every i. Thus, C{K,Y) C CIk{Y). Because 
C{K, Y) is closed under K and CIk{Y) is the least set of literals closed under 
K, we can conclude that CIk{Y) C C{K,Y). 

To complete the proof of the first item, we need to show that if CIk{Y) is 
defined, then C{K,Y) is consistent and C{K,Y) = CIk{Y). Again, it follows 
immediately from Equation (60) that Mk{S) C CIk{S) for every consistent 
set 5*. This implies that M^(y) C CIk{Y) for every i. Thus, we have that 
C{KX) C CIk{Y). This implies the consistency of C{K,Y). The equality 
C{K,Y) = CIk{Y) follows from the closeness of C{K,Y) with respect to K 
and the definition of CIk{Y). This concludes the first item of the lemma. 

(2) Consider the case where C{K,Y) is inconsistent. Assume that CIk{Y) is 
defined. By definition of M|^-(y), we know that if M^(y) is inconsistent 
then M]^^{Y) is also inconsistent. Thus, there exists an integer k such that 
M^(Y) is consistent and M^^^(y) is inconsistent. Because of the consistency 
of ClxiY) and M^{Y) C CIk{Y), we conclude that M^+\Y) \ ClxiY) + 0. 
Consider I e M^+^(F) \ CIk^Y). By the definition of Mk, there exists some 
static causal law 



in K such that {pi, . . . C M^(y). This imphes that Ciif(F) is not closed 
under K, which contradicts the definition of CZx (5^)- This shows that CIk^) 
is undefined. 

Now, consider the case where CIk(Y) is undefined. Using contradiction and 
the result of the first item, we can also show that C{K, Y) is inconsistent. This 
concludes the proof of the second item of the lemma. 



Lemma .2. For a set of causal laws K and a set of fluent literals Y , for every 

integer k, the program consisting of the following rules: 



caused({pi, . . . ,_p„},0 



□ 



holds{l, k) ^ 
holds{l, k) <— 



holds{li, fc), . . . , holds{lm, k) 



{ifca.used{{li,...,lrn},l) & K) 
{iflGY) 



_L ^ fluent{f),holds{f,k),holds{-nf,k) (iffeF) 
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(1) has a unique answer set {holds{l,k) \ I £ CIk{Y)} iff CIk{Y) is defined; and 

(2) does not have an answer set iff CIk{Y) is undefined. 

Proof. Let us denote the given program by Q and P be the program consisting 
of the rules of Q with the head different than ±. Since P is a positive program, 
we know that it has a unique answer set, says X. It is easy to see that if X is 
consistent with respect to F, then X is the unique answer set of Q; otherwise, Q 
does not have an answer set. 

It is easy to see that X = {holds{l,k) | / e C{K,Y)} is the unique answer set of 
P. Consider the two cases: 

(1) CIk{Y) is defined. Lemma .1 impUcs that C{K,Y) = CIk{Y), and hence, X 
is consistent with respect to F. This imphes that X is the unique answer set of 
Q. 

(2) CIk{Y) is undefined. Lemma .1 impHcs that C{K,Y) is inconsistent, which 
implies that X is inconsistent with respect to F, and hence, Q does not have 
an answer set. 

□ 

We now prove some useful properties of tt. We will prove that if {D, F) is consistent 
then TT is consistent (i.e., tt has an answer set) and that tt correctly implements 
the transition function $ of D. First, we simplify tt by using the splitting theo- 
rem [Lifschitz and Turner 1994] (Theorem .11, Appendix B). Let V be the set of 
literals in the language of tt whose parameter list does not contain the time param- 
eter, i.e., V consists of auxiliary atoms of the form literal{l), fluent{f), action{a), 
contrary {li,^)- 

It is easy to see that F is a splitting set of tt. Furthermore, it is easy to see that 
the bottom program 6\/(7r) consists of the rules that define actions, fluents, and 
the rules (14)-(17). Obviously, 6y(7r) is a positive program, and hence, it has a 
unique answer set. Let us denote the unique answer set of &y(7r) by Aq. The partial 
evaluation of tt with respect to (V, Aq), tti = ey(7r \ 6y(7r), ^o), is the collection of 
the following rules: 

holds{l,t+l) <— occ{a,t),holds{li,t), . . . ,holds{lk,t). (61) 
(if causes(a, I, {li, . . . , Ik}) £ D) 
holds{l,t) <— holds{li,t), . . . ,holds{lm,t). (62) 
(if caused({Zi, . . . , Im}, I) e D) 
possible{a,t) <— holds{li,t), . . . ,holds{lt,t). (63) 
(if executable(a, {Zi, . . . , It}) G D) 
holds{l,0) ^ (64) 
(if initially(0 € F) 
occ{a,t) possible{a,t),notnocc{a,t). (65) 
(if a is an action) 
nocc{a,t) <— occ{b,t). (66) 
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(for every pair of actions a ^ b) 
holds{f,t+l) ^ holds{f,t),notholds{^f,t+l). (67) 
(for every fluent /) 
holds{^f,t+l) ^ holds{-nf,t),notholds{f,t+l). (68) 
(for every fluent /) 
_L ^ holds{f,t),holds{^f,t). (69) 
(for every fluent /) 

It follows from the splitting theorem that to prove the consistency and correctness 
of TT it is enough to prove the consistency of tti and that tti correctly implements 
the transition function $ of D. We prove this in the next lemmas. 

Lemma .3. Let (-D, F) be a consistent action theory. Let X be an answer set of 
TTi . Then, 

(1) St{X) is a state of D for every t € {0, . . . , n} for every t G {0, . . . ,n — 1}, 

(2) if X contains occ{a,t) thena is executable in st{X) and st+i(X) e ^{a, st{X)), 
and 

(5) for every t G {0, . . . ,n — 1}, if occ{a, t) ^ X for every action a, then St+i{X) = 

st{X). 

Proof. It is easy to see that the sequence {UtYf^Q, where 

Ut = {holds{l, A;) I / is a literal and k < t}U 
{occ{a, fc) I a is an action and k < t}\J 

{nocc{a, fc) I a is an action and k < t}U 
{possible{a, k) \ a is an action and k < t} U {-L}, 

is a splitting sequence of tti. Since X is an answer set of tti, by the splitting 
sequence theorem (Theorem .12, Appendix B), there exists a sequence of sets of 
literals {Xt)^^^ such that Xt CUt \ Ut-i, and 

^ = Ur=o 

— Xq is an answer set of bij^ (tti ) and 

— for every t > 0, Xi is an answer set of Cf/^ (fef/^ (tti) \ &;7f-i (^i)j Ui<t-i -^i)- 

We will prove the lemma by inductively proving that for every t, < t < n, the 
following holds: 

(i) Xt is complete and consistent with respect to F in the sense that for each fluent 
/, Xt contains either holds{f,t) or holds{-^f,t) but not both, 

(ii) Xt contains at most one atom of the form occ{a,t)., 

(iii) st{Xt) is a state of D, and 

(iv) if occ{a,t — 1) G Xt-i then a is executable in st-i{Xt-i) and St{Xt) £ 
$(a, Si_i(Xf_i)); if no atom of the form occ{a,t — 1) belongs to Xt-i then 
St-i{Xt_^) = St{Xt). 



i^Recall that for every set Y, st{Y) is the set {/ | holds{f,t) € Y} 
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Base case: t = 0. Trivially, Xq satisfies (iv). So, we only need to show that 
Xq satisfies (i)-(iii). Let Pq = buoi^^i)- We have that Pq consists of only rules of 

the form (62)-(66) and (69) with t = 0. Let Zq = {holds{f, 0) [ / is a fluent} U 
{holds{^f,0) I / is a fluent}. We can easily checked that Zq is a splitting set of 
Pq. Thus, by the splitting theorem, Xq = Mq U A^o where Mq is an answer set of 
bzo{Po) and A^o is an answer set of ezo,M„ = ez„{PQ \ bz„{Pa), Mq). Because Mq 
contains only literals of the form holds{l,0) and A^o contains only literals of the 
form occ{a,0), nocc{a,0), and possible{a,0), we have that so(-^o) = so(Afo) and 
occ{a,0) e Xq iff occ{a,0) G A^o- Hence, to prove that Xq satisfies (i)-(iii), we show 
that Mfj satisfies (i) and (iii) and A"o satisfies (ii). 

We have that the bottom program bzo{Po) consists of rules of the form (64) and 
(62). Because of the consistency of {D,r), we have that so = {/ I initially(/) e F} 
is consistent and hence {holds{f, 0) | initially(/) G F} is consistent with respect to 
F. It follows from Lemma .2 that Mq = {holds{f,0) \ f G Sq} is the unique answer 
set ofbzoiPo) where sq is the initial state of (£>, F) . Because of the completeness of F 
and the consistency of (D, F), we can conclude that Mq is complete and consistent. 
Thus, Mq satisfies (i). Furthermore, because sq{Mq) = sq, we conclude that Mq 
satisfies (iii). 

The partial evaluation of Pq with respect to {Zo,Mo), ezo,Mo, consists of 

' possible{a, 0) <— (al) 
(if executable(a, {^i, . . . , Im}) S D 
and holds{li,0) G Mq) 
occ{a,0) possible{a,0),not nocc{a,0). (a2) 

nocc{a,0) <— occ{b,0). (a3) 
(for every pair of actions a ^ b) 
^ holds{f,0),holds{^f,0) (a4) 
(for every fluent /) 

Let R be the set of atoms occurring in the rule (al) of ezo.Mo- There are two cases: 

— Case 1: i? = 0. Obviously, the empty set is the unique answer set of ezo,Mo- 

Thus, A'o does not contain any atom of the form occ(a, 0). 
— Case 2: i? / 0. By applying the splitting theorem one more time with the 

splitting set R we can conclude that A^o is an answer set of ezo,Mo if and only if 

there exists some action a, possible{a, 0) G R, and 

Nq = RU {occ{a, 0)} U {nocc{b, 0) | 6 is an action m D,b^ a}. 

Thus, A^o contains only one atom of the form occ(a, 0). 

The above two cases show that Nq contains at most one atom of the form occ{a, 0). 
This concludes the proof of the base case. 

Inductive step: Assume that Xf, t < k, satisfies (i)-(iv). We will show that X^ 
also satisfies (i)-(iv). Let Mfe_i = {J^Iq Xt. The sphtting sequence theorem implies 
that X/j is an answer set of Pk that consists of the following rules: 

holds{l,k) ^ (70) 
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(if occ{a, /c — 1) G Mfe_i, 






causes(a, I, {h, . . . , Im}) G D, 






holds{li,k- 1) e Mfe_i) 




holds{l, k) 


<— holds{li,k), . . . ,holds{lm,k). 


(71) 




(if caused({/i, . . . , Im}, I) G D) 




possible{a, k) 


^ holds{li, k), . . . , holds{lt, k). 


(72) 




(if executable(a, {li, . . . , k}) S D) 




occ{a, k) 


^ possible{a,k),not nocc{a,k). 


(73) 




(if a is an action) 




nocc{a, k) 


<— occ{b,k). 


(74) 




(for every pair of actions a ^ b) 




holds{f, k) 


^ not holds{-'f, k). 


(75) 




(if holdsif,k-l) e Mk-i) 




holds{^f, k) 


<— not holds{f, k). 


(76) 




(if holds{^f,k-l) e Mk-i) 




_L 


<— holds{f,k),holds{-^f,k). 


(77) 



From the constraint (77), we have that for every fluent /, Xf. cannot contain both 
holds{f, k) and holds{^f,t). This means that X). is consistent. We now show that 
Xk is also complete. Assume the contrary, i.e., there exists a fluent / such that 
neither holds{f, k) nor holds{-^f, /) belongs to X^. Because of the completeness of 
Sfc_i(Xfe_i) (Item (i), inductive hypothesis), either holds{f,k — 1) G Sk-i{Xk-i) 
or holds{f,k — 1) ^ Sk-i{Xk-i). If the first case happens, rule (75) belongs to 
Pfe, and hence, X^ must contain holds{f,k), which contradicts our assumption 
that holds{f, k) ^ Xk- Similarly, if the second case happens, because of rule (76), 
we can conclude that holds{-if , k) G Xk which is also a contradiction. Thus, our 
assumption on the incompleteness of Xk is incorrect. In other words, we have 
proved that Xk is indeed complete and consistent, i.e., (i) is proved for Xk- We 
now prove the other items of the conclusion. Let 



Yfc = {holds{l,k) I Hs a fluent literal and holds{l,k) G Xk} 



and 

Zk = {holds{l, fc) I Hs a fluent literal}. 

Zk is a splitting set of Pk- Let TTk = bz^ (Pk)- Prom the splitting theorem, we know 
that Yfe must be an answer set of the program {wk)^'' that consists of the following 
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rules: 

holds{l, k) 

holds{l, k) 
holds{f, k) 
holds{-'f, k) 

_L 



(if occ{a, k - 1) e Mfe_i, causes(a, l,{li, . . . , e D, 
holds{li, k — 1) e Mk-i for i = 1, . . . , m) 
holds{li, fc), . . . , holds{ln, k). 
(if caused({Zi, . . . , Z„}, I) G D) 

( if holds{f, k- 1) e Mk-i and holds{^f, k) ^ Yfe) 

( if holdsi^f, k-l)e Mk-i and holds{f, k) ^ Yk) 
holds{f, k), holds{^f, k). 
(for every fluent /) 



(bl) 

(b2) 
(b3) 
(b4) 
(b5) 



Let Qi and Q2 be the set of atoms occurring in the rule (bl) and (b3)-(b4), respec- 
tively. Let Ci = {/ I holds{l,k) € Qi} and C2 = {I \ holds{l,k) € Q2}- There are 
two cases: 

— Ccise 1: Mk-i does not contain an atom of the form occ{a,k — 1). We have 
that Qi = and C2 C Sk-i{Xk-i). From Lemma .2, we know that (TTfe)^* has a 
unique answer set {holds{f,k) \ f £ C/£)p(C2)} which is Yfe. Because Sfe(Xfe) = 
{/ I / e Yk} and the definition of CIdc^ we have that Sk{Xk) C Sk-i{Xk-i). The 
completeness and consistency of Sk{Xk) and Sk^i{Xk-i) implies that Sfe(Xfc) = 
Sfc_i(Xfe_i). Because Xfe_i satisfies (i)-(iv), Xk also satisfies (i)-(iv). 

— Case 2: There exists an action a such that occ{a,k — 1) G Mfe_i. Because of 
the rule (bl) we have that Ci = E{a, Sk-i{Xk-i))- The completeness of Sk{Xk) 
and Sfe_i(Xfe_i) and the rules (b3)-(b4) imply that C2 = Sk{Xk) n Sfe_i(Xfe_i). 
Furthermore, Lemma .2 implies that (nk)^'' has a unique answer set {holds{f, k) \ 
/ G CId^,{Ci U C2)} which is Y/,, (because Yfe is an answer set of TTfe). Hence, 
Sk{Xk) = Cii3c(£;(a, Sfe„i(Xfe_i)) U {sk{Xk) r\ Sk-i{Xk-i))). This implies that 
Sk{Xk) G $(a, Sfe_i(Xfe_i)). In other words, we have proved that Xk satisfies 
(iii)-(iv). 

The above two cases show that X^ satisfies (iii) and (iv). It remains to be shown 
that Xk contains at most one atom of the form occ{a, k). Again, by the splitting 
theorem, we can conclude that Nk = Xk\ Yk must be an answer set of the following 
program 



en = < 



' possible{a, k) 

occ{a, k) 

nocc{a, k) 



(if executable(a, {li, . . . , Im}) G D 
and holds{li,k) G Yfe) 
possible{a, k), not nocc{a, k). 
(if a is an action) 
occ(b, k). 

(for every pair of actions a^h) 



Let Rk be the set of atoms occurring in the first rule of ey^ . Similar to the proof 
of the base case, we can show that for every answer set A^fe of ey^ , either Nk does 
not contain an atom of the form occ{a, k) or there exists one and only one action 
a such that possible{a, k) G Rk and Nk = iife U {occ{a, k)} U {nocc{b, a) | 6 is an 
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action, b ^ a}. In either case, we have that =Y}.{J satisfies the conditions 
(ii). The inductive step is proved. 

The conclusion of the lemma follows immediately from the fact that St{X) = Sf (Xj) 
for every t and occ{a, t) € X iS occ{a, t) £ X^ and X^ satisfies the property (i)-(iv). 
The lemma is proved. □ 

Lemma .4. For every trajectory soag . . . a„_is„ in D and a consistent action 
theory {D,T), m has an answer set X such that 

(1) St{X) = st for every t, < t < n, and 

(2) occ{at, t) G X for every t, < t < n — 1. 

Proof. We prove the theorem by constructing an answer set X of tti that sat- 
isfies the Items 1 and 2. Again, we apply the splitting sequence theorem with the 
splitting sequence (C/t)"=0) where 

Ut = {holds{l, fc) I Hs a literal and k < t}\J 
{occ{a, fc) I a is an action and k < t}U 
{nocc{a, fc) I a is an action and k < t}L\ 
{possible{a, fc) | a is an action and fc < t} U {-L}. 

For every t, < t < n, let Rt = {possible{a, t) \ a is executable in s^}. We define a 
sequence of sets of literals (Xt)"^o ^ follows. 

—For < t < n - 1, 

Xt = {holds{f,t) I / e St} U {occ{at,t)}U 

{nooc{b, t) \ b is an action in D, 6 ^ at} U Rt- 

— If Rn^%, then let a„ be an arbitrary action that is executable in s„ and 

Xn = {holds{f,n) I / e s„} U {occ(a„,n)}U 

{nooc{b, t) \ b is an action in D, b ^ a„} U 

—If Rn = 0, then 

= {holds{f,n) I / G Sn} U {nooc{b,t) | 6 is an action in D} U J?„, 

We will prove that {Xt)"^Q is a solution to m with respect to {Ut)t^Q- This amounts 
to prove that 

— Xo is an answer set of buo{7^i) and 

— for every t > 0, X^ is an answer set of {bu^ (tti) \ but_-, (tti), Ui<t-i ^i)- 

We first prove that Xq is an answer set of Pq = buo{T^i)- By the construction of Pq 
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and Xq, wc have that (Pq) consists of the following rules: 



(Po) 



Xo 



holds{f,0) 


^ (if initially(/) € T) 


(al) 


holds{l,0) 


<— holds{li,0), . . . ,holds{lm,0)- 


(a2) 




(if caused({Zi, . . . , I) e D) 




possible{a, 0) 


<— holds{li,0), . . . ,holds{lm,0)- 


(a3) 




(if executable(a, {Zi, . . . , e £)) 




occ(ao, 0) 


<— possible{a(), 0). 


(a4) 


nocc{b, 0) 


<— occ(a,0). 


(«5) 




(for every pair of actions b ^ a) 




_L 


^ holds{f,0),holdsi^f,0) 


(a6) 




(for every fluent /) 





We will show that Xq is a minimal set of literals closed under the rules (al)-(a6) 
and therefore is an answer set of Pq- Since holds{f,0) G Xq iS f G sq (Definition 
of Xq) and / G sq iff initially(/) £ F (Definition of ,so), we conclude that Xq is 
closed under the rule of the form (al). Because of so is closed under the static 
causal laws in D, we conclude that Xq is closed under the rule of the form (a2). 
The definition of Ro guarantees that Xq is closed under the rule of the form (a3). 
Since sgao . . . a„_is„ is a trajectory of D, ao is executable in 5*0. This implies that 
possible{ao, 0) G Rq. This, together with the fact that occ{aQ, 0) € Xq, implies that 
Xf) is closed under the rule (a4). The construction of Xq also implies that X^ is 
closed under the rule (a5). Finally, because of the consistency of F, we have that 
Xq does not contain holds{f,0) and holds{neg{f),0) for any fluent /. Thus, Xq is 
closed under the rules of (Po)^°- 

To complete the proof, wc need to show that Xo is minimal. Consider an arbitrary 
set of atoms X' that is closed under the rules (al)-(a6). This implies the following: 

— holds{f,0) e X' for every / G sq (because of the rule (al)). 
— Ro C X' (because of the rule (a3) and the deflnition of Rq). 
— occ{ao,0) S X' (because of the rule (a4)). 

— {nocc{b,0) I 6 is an action, b ^ a} C X' (because occ(ao,0) € X' and the rule 
(a5)). 

The above items imply that Xq C X'. In other words, we show that Xo is a minimal 
set of literals that is closed under the rules (al)-(a6). This implies that Xq is an 
answer set of (Pq)^", which implies that Xq is an answer set of Pq- 

To complete the proof of the lemma, we will prove by induction over t, t > 0, that 
Xf is an answer set of = eu^{bu^_^ {tti) \ 6(7t_i(7ri),U-<j_-^ Xj). Since the proof 
of the base case {t = I) and the inductive step is similar, we skip the base case and 
present only the proof for the inductive step. Now, assuming that Xt, t < k, is an 
answer set of Pf. We show that Xk is an answer set of P^. Let M^-i = Ui<fe-i -^i- 
The program Pk consists of the following rules: 

holds{l,k) ^ (78) 

(if causes(afe_i, /, {h, . . . , /„}) e D, holds{li, k — 1) G Mfe_i) 
holds{l,k) <— holds{li,k), . . . ,holds{lm,k). (79) 
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(if caused({Zi, . . . , I) e D) 
possible{a,k) <— holds{li, k), . . . , holds{lt, k). (80) 
(if executable(a, {h, . . . , It}) G D) 
occ{a,k) <— possible{a,k),not nocc{a,k). (81) 
(if a is an action) 

nocc{a,k) ^ occ{b,k). (82) 

(for every pair of actions a ^ b) 
holds{f,k) <— not holds{^f,k). (83) 
(if holds{f, fc - 1) e Mfe_i) 
holds{-^f,k) •!— not holds{f,k). (84) 
(if holdsi^f,k-l) G Mfe_i) 
_L holds{f,k),holds{^f,k). (85) 



It is easy to sec that Pk can be split by the set of Uteral Zk — {holds( f, k) | / is a 
fluent Uteral} and the bottom program tt^ = bzf.{Pk) consists of the rules (78)-(79) 
and (83)-(84). We will prove first that Yfe = {holds{l,k) \ holds{l,k) G X^} is an 
answer set of the program (iTk)^'' that consists of the following rules: 



holds(l,k) ^ (bl) 

(if causes(afe_i, Z, {h,..., Im}) € D, 

holds{li, k — 1) G Mfe_i for i = 1, . . . , m) 
holds{l,k) ^ holds{li,k), . . . ,holds{ln,k). (b2) 

(if caused({Zi, . . . , In}, I) G D) 
holds{f,k) ^ (bS) 

(if holds{f, A: - 1) G Mu-i and holds{^f, k) ^ Yk) 
holds{-nf,k) ^ (b4) 

(if holdsi^f, fc - 1) G Mk-i and holds{f, k) ^ Yk) 
_L ^ holds{f,k),holds{^f,k). (b5) 



Let Qi and Q2 be the set of atoms occurring in the rule (bl) and (b3)-(b4), re- 
spectively. Let Ci = {I \ holds{l,k) G Qi} and C2 = {I \ holds{l,k) G Q2}- 
Rule (bl) and the fact that / G Sk-i{Xk-i) iff holds{f,k — 1) G M^-i imply 
that Ci — E{ak-i,Sk-i{Xk-i)). Similar argument allows us to conclude that 
C2 = Sk{Xk) n Sk-i{Xk-i). Lemma .2 implies that (ttj;)^'' has a unique answer 
set Y = {holds{f,k) \ f e ClodCi U C2)}. Since occ{ak-i,k - 1) G Mk-i and 
Sk{Xk) G <E>(afc_i,Sfc_i(Xfc_i)), wc have that .Sfc(Xfe) = CIdc{Ci U C2). It follows 
from the definition of Y^ that Yk = Y. Thus, Y^ is an answer set of tt/j. It follows 
from the splitting theorem that to complete the proof of the inductive step, we need 
to show that Nk = Xk \ Yk is an answer set of the partial evaluation of Pk with 
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respect to {Zk, Ife), eZk.Yk = ez^ (Pk \ bz^ (Pk),Xk), which is the following program 
possible{a, k) ^ (if executable(a, . . . , Im]) G D 



and holds{li, k) e Yfe) 
occ{a,k) possible{a,k),not nocc{a,k). 
(if a is an action) 
nocc{a,k) <— occ{b,k). 

(for every pair of actions a^b) 



It is easy to sec that the reduct of eZk,Yk with respect to Nk, {ez,,,Y,,)^'' , consists 
of the following rules 

possible{a, k) ^ (if executable(a, {/i, . . . , Im}) G D 
and holds{li, fc) € Yfe) 
occ{ak,k) ^ possible{ak,k). 
nocc{a,k) <— occ{b,k). 

(for every pair of actions a ^ b) 

Let Rk be the set of atoms occurring in the first rule of {ez^,Yk)^'°- Because 
soflo . . . a„s„ is a trajectory in D, ak is executable in ,Sfe. Thus, possible{ak,k) 
belongs to Rk- It is easy to see that Nk is the unique answer set of {eZk,Yk)^^ ■ In 
other words, Nk is an answer set of ez,,,Yu- The inductive step is proved. 

The property of implies that the sequence (Xt)JLo ^ solution to tti with 
respect to the sequence {UiYI^q. the splitting sequence theorem, X = [J^^q X^ 
is an answer set of tti. Because of the construction of Xt, we have that St{X) = 
St{Xt) = St for every t and occ{at,t) G X for every t, < t < n. The lemma is 
proved. □ 

The above lemmas lead to the following corollaries. 

Corollary .5. Let X be an answer set ofn. Then, 

(i) St{X) is a state of D for every t, < t < n, 

(a) if X contains occ{a, t) then a is executable in St{X) and st+i{X) e $(a, St{X)) 

for every t, < t < n — 1, and 
(Hi) if occ{a,t) ^ X for every action a, then St+i{X) = St{X) for every t, <t < 

n-1. 

Proof. It follows from the splitting theorem that Y — X O lit{ni) is an answer 
set of TTi. Because St{X) = st{Y) and Lemma .3, we conclude that X satisfies the 
(i)-(iii). □ 

Corollary .6. For every trajectory soao ■ ■ ■ a„_is„ in D and a consistent ac- 
tion theory {D, T), w has an answer set X such that 

(i) St{X) = St for every t, < t < n, and 

(ii) occ{at,t) S X for every t, < t < n — 1. 

Proof. From Lemma .4, there exists an answer set Y of tti such that St{Y) = St 
and occ{at,t) G Y. Again, from the splitting theorem, we can conclude that there 
exists an answer set X of tt such that Y = X n lit{'Ki). Because st{X) — st(Y), we 
conclude that X satisfies (i)-(ii). □ 
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The next observation is also useful. 

Observation .7. For every answer set X of tt, if there exists an t such that X 
does not contain an atom of the form occ{a,t), then X does not contain an atom 
of the form occ{a, t') for t <t' . 

Using the result of the above corollaries we can prove Theorem 3.2. 
Theorem .1. For a planning problem (D,r, A), 

(i) if Soao . . . a„_is„ is a trajectory achieving A, then there exists an answer set 

M of n„ such that 

(1) occ{ai, z) e M for i G {0, . . . , n — 1} and 

(2) Si = Si{M)ioTiG{0,...,n}. 
and 

(ii) if M is an answer set of n„, then there exists an integer < fc < n such 

that so{M)ao ■ ■ ■ ak-iSk{M) is a trajectory achieving A where occ{ai, i) G M for 
<i < k and if A: < n then no action is executable in the state Sfe(M). 

Proof. We have that n„ = tt U {(20), ^ not goal}. Assume that A = pi A . . . A 
Pk- 

(i) . Since sqOo . . . a„_is„ is a trajectory achieving A, the existence of X that 

satisfies the condition (i) of the theorem follows from Corollary .6. Furthermore, 
because of s„ |= A, we can conclude that holds{pi,n) G X for every i, 1 < i < k. 
Thus, X U {goal} is an answer set of n„. This implies the existence of M satisfying 

(ii) . Let M be an answer set of n„. It is easy to see that this happens iff goal G M 
and X = M\{goal} is an answer set of tt and holds{pi, n) € X for every i, 1 < i < k. 
It follows from Observation .7 that there exists an integer k <n such that for each 
i, < i < k, there exists an action such that occ{ai,i) G M and for t > k, 
occ{a,t) ^ M for every action a. By Corollary .5, we know that ai is executable 
in Si{M) and Sj+i(M) G $(ai,Sj(M)). This means that so{M)ao . . .ak-iSk{M) is 
a trajectory and Sk{M) = Sn{M). Moreover, A holds in s„(M) = Sk{M). Thus, 
so(M)ao . . .ak-iSk{M) is a trajectory achieving A. Furthermore, it follows from 
Corollary .5 and the rules (65) and (66) that if fc < n then M does not contain 
literals of the form possiWe(a, k). This implies that no action is executable in Sk{M) 
if k < n. □ 

Appendix A. 2 - Proofs of Theorem 4.9 

Theorem .2. Let 5 be a finite set of goal- independent temporal formulae, / = 
(so, si . . . Sn) be a sequence of states, and 

'^formulaiS, I) = formula U r{I) U r{S) 

where 

— r{S) is the set of atoms used in encoding S, and 

— r{I) = L}"^Q{holds{l,t) | Hs a fluent literal and I G St}. 

Then, 
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(i) The program liformuiaiS, I) has a unique answer set, X. 

(ii) For every temporal formula ^ such that formula{n^) G r{S), (j) is true in /j, 
i.e., It \= (j>, if and only if hf{n^,t) belongs to X where It = {st, ■ ■ ■ Sn)- 

Proof. First, we prove (i). We know that if a program is locally stratified then 
it has a unique answer set. We will show that ^formuia{,S, I) (more precisely, the set 
of ground rules oili formuia{S^ I)) is indeed locally stratified. To accomplish that we 
need to find a mapping A from literals of 11 formula {S, I) to N that has the property: 
if ^ Ai,A2,... A„, not Bi,not i?2, • • • not Bm is a rule in Hformuia{S, I), then 
A(Ao) > A(A,) for alll < i < n and X{Aa) > X{Bj) for all 1 < j < m. 

To define A, we first associate to each constant (f> that occurs as the first param- 
eter of the predicate formula{.) in Ilformuia{S,I) a non-negative number (7(0) as 
follows. 

— a{l) = if Z is a literal (recall that if Hs a literal then m = I). 

— a{n^) = <T{n^^) -j- 1 if ^ has the form negation(<;;!)i), next(0i), eventually(<;ii), 
or always 

— a{n^) — max{(7(n^J, cr(n^2)} + 1 if bas the form and((/>i, (/>2), or((/>i, (/>2), or 
until(0i,02)- 

We define A as follows. 

— X{hf{n^, t)) = 5 * a{n^) + 2, 

— \{hf -during{n^, t, t')) = 5 * cr(n^) -|- 4, and 

— X{1) = for every other literal of ^formula{S, I)- 

Examining all the rules in Ilformula{S, I), we can verify that A has the necessary 
property. 

Wc now prove (ii). Let X be the answer set oiIlformuia{S, I). We prove the second 
conclusion of the lemma by induction over a{n^). 

Base: Let (/> be a formula with a{n^) = 0. By the definition of cr, we know that 
is a literal. Then </> is true in St iff (j) is in St, that is, iff holds{^,t) belongs to X, 
which, because of rule (22), proves the base case. 

Step: Assume that for all < j < k and formula (j) such that a{n^) = j, the 
formula (j) is true in St iff hf{n^, t) is in X . 

Let (j) be such a formula that a{n^) = k + 1. Because of the definition of cr, ^ is a 
non-atomic formula. We have the following cases: 

— Case 1: (f) = negation((/)i). We have that (j{n^^) = cr(n^) — 1 = k. Because of 

formula{n^) G X and negation{n^^n^-^) G X, hf{n^,t) G X iif the body of rule 
(27) is satisfied by X iff hf{n^^,t) ^ X iS st ^ (pi (by inductive hypothesis) iff 
St \= (p- 

— Case 2: (j) = and((/)i, (f)2)- Similar to the first case, it follows from the rule (23) 
and the facts formula(n^) and and{n^, n^^ , n^^) that hf{n^, t) & X iif the body 
of rule (23) is satisfied by X iff hf{n^^,t) G X and hf{n^^,t) & X ]£ st \^ 4>i 
and St \= (j}2 (inductive hypothesis) iff St \= (j). 
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— Case 3: (j) = or((/)i, (/)2)- The proof is similar to the above cases, relying on the 
two rules (25), (26), and the fact formula{n^) G X and or{n^,n^^,n^^) G X. 

— Case 4: (/) = until (0i, (/)2). Wc have that ^(n^J < k and <j{n^^) < k. Assume 
that It \= (j). By Definition 4.7, there exists t <t2 < n such that It^ \= ^2 and 
for all t <t\ < t^, Iti \= <pi- By inductive hypothesis, hf{n^2i^'i) ^ ^ and for 
all ii, t < ti < t2, hf{n^-^,ti) £ X. It follows that hf -during {n^^,t,t2) G X. 
Because of rule (28), we have hf{n^,t) € X. 

On the other hand, if hf{n^, t) G X, because the only rule supporting hf{n^, t) is 
(28), there exists t < t2 < n such that hf -during {n^^,t,t2) G X and hf(n^^,t2)- 
It follows from hf -during {n^i,t, 12) G X that hf{n^^,ti) G X for all t < ti < t2- 
By inductive hypothesis, we have It^ \= (pi for all t <t\ <t2 and It^ \= (f)2 ■ Thus 
/( 1= until((/)i,02), i-c, It h 

— Case 5: (/) = next((/)i). Note that ^(n^^) < fc. Rule (31) is the only rule 
supporting /i/(n0, f) where (/) = next((^i). So hf{n^,t) £ X ifS hf{n^^,t+l) € X 
iff It+i H 01 iff H next(<^i) iff It \= cj>. 

— Case 6: (/i = aWays((/ii). We note that <T{n^-^) < k. Observe that hf{n^,t) is 
supported only by rule (29). So hf{n^,t) G X iff hf -during {n^-^,t,n) G X. The 
latter happens iff hf{n^^,t\) G X for all t < ti < n, that is, iff 7ti |= (pi for all 
t < ti < n which is equivalent to It \= always((^i), i.e., iff It ^ (f>- 

— Case 7: (j) = eventually(0i). We know that hf{n^, t) & X \s supported only by 
rule (30). So hf{n^,t) G X iff there exists t <t\<n such that hf{n^^,t-i) G X. 
Because (T{n^-^) < k, by induction, hf{n^,t) G X iff there exists t <ti <n such 
that ^ 01, that is, iff It \= eventually((/)i), i.e., iff It ^ (j). 

The above cases prove the inductive step, and hence, the theorem. □ 
Appendix A. 3 - Proof of Theorem 4.18 

We first prove some lemmas that are needed for proving Theorem 4.18. Abusing the 
notation, by tt/ we denote the program consisting of the rules of tt (Appendix A.l) 
and the set of rules H formula where the time constant T takes the value between 
and n. 

Lemma .3. For a consistent action theory {D,T), a ground complex action p, 
and an answer set M of 11^°^°^ with occ{ai,i) G M for i G {0, . . . ,n — 1}, 
so{M)aoSi{M) . . .a„_is„(M) is a trace of p. 

Proof. It is easy to sec that the union of the set of literals of tt/ and the 
set of rules and atoms encoding p, i.e., U = lit{-Kf) U r{p), is a splitting set of 
jlGolog Furthermore, &(7(n^'''°s) = tt/ U r{p). Thus, by the splitting theorem, M 
is an answer set of 11^°'°^ iff M = X U F where X is an answer set of tt/ U r{p), 
and Y is an answer set of 6(7(11^°'°^ \ tt/, AT). Because of the constraint _L <— 
not trans{np, 0, n), we know that if M is an answer set of 11^°'°^' then every answer 
set Y of eij {n^°^°s \Tr f , X) must contain trans{np, 0, n). Furthermore, we have that 
St{X) = St{M) for every t. Hence, in what follows we will use St{X) and St{M) 
interchangeably. We prove the conclusion of the lemma by proving a stronger 
conclusion^^: 



^''Recall that for simplicity, in encoding programs or formulae we use i or a as the name associated 
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(*) for every ground complex action q with the name Uq and two time points ti,t2 
such that g 7^ null and trans (rig, fi, G M, st^(M)atiSti+i(Af) • • •at2-iSt2(-^) 
is a trace of q (the states Si{M) and actions Oj are given in the Lemma's state- 
ment). 

Denote tts — e[/(n^°'°^ \7r/,X). We have that tts consists of the foUowing rules: 



trans{a, t,t + I) 


^ (if action(a) e X, occ{a, t) G X) 


(86) 


trans{f,ti,ti) 


^ (if ,form,ula{f, .) e X, hf{f,h) G X) 


(87) 


trans{p, ti, 


^ tl<t' < t2,trans(j)i,ti,t'),trans{p2,t' ,12)- 


(88) 




(if sequence{p,pi,p2) G X)) 




trans{n,ti,t2) 


<— trans{pi,ti,t2)- 


(89) 




(if choice Action{n) G X, m(pi,n) G X) 




trans(i, ti, 12) 


<— trans{pi,ti,t2)- 


(90) 




(ifi/(i,/,Pi,P2) gX, hf{f,h)ex) 




trans{i, ti, t2) 


<— trans{p2,ti,t2)- 


(91) 




{iiif{i,f,PuP2)GX, hfif,h)^X) 




trans{w,t\,t2) 


<— ti <t' < t2,trans{p,ti,t'),trans{w,t' ,t2)■ 


(92) 




iiiwhileiw,f,p)eX, hf{f,h)ex) 




trans{w, t, t) 


^ {iiwhMwj,p)ex,hf{f,t)^x) 


(93) 


trans{s, ti,t2) 


<— trans{p,ti,t2)- 


(94) 




(if choice Args{s,p) G X) 




trans{null, t, t) 




(95) 



Clearly, tts is a positive program. Thus, the unique answer set of 773, denoted by 
F, is the fix-point of the Tt^^ operator, defined by {X) = {o \ there exists a rule 
o <— Oi, . . . , o„ in 7r3 such that Oj G X for i = 1, . . . , n}. Let Yfe = T^^(0). By 
definition Y = lim„^oo Yn. 

For every atom o G F, let p{o) denote the smallest integer k such that for all 
< t < k, o ^ Yt and for all t > k, o £ Yf. (Notice that the existence of p(o) is 
guaranteed because is a monotonic, fix-point operator.) 

We prove (*) by induction over p{trans{nq,ti,t2))- 

Base: p{trans{nq,ti,t2)) = 0. Then tts contains a rule of the form 
trans{nq,t\,t2) <— • Because q ^ null, we know that trans(nq,t\,t2) <— comes 
from a rule r of the form (86), (87), or (93). 

— r is of the form (86). So, q is some action a, i.e., action{a) and occ{a,t) both 
belong to X. Further, t2 = ti + 1. Because of Corollary .5 we know that a is 

executable in st-^{X) and st2{X) G <l>(a, (X)). Since St{M) = St(X) for every 
we have that St^ (M) a (M) is a trace of q. 



to i or a, respectively. 
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— r is of the form (87). Then q = 4>,t2 = ti = t, where </> is a formula and hf{n^, t) 

is in X. By Theorem 4.9, (f) holds in St{X). Again, because st{M) = st{X), we 

have that St{M) is a trace of q. 
— r is of the form (93). Then, ti = t2, while{nq,(f),pi) £ X, and hf{n^,ti) ^ X. 

That is, q is the program "while (j) do pi" and (j) does not holds in stj(M). 

Thus, Sti (M) is a trace of q. 

Step: Assume that we have proved (*) for p{trans{nq,ti,t2)) < k. We need to 

prove it for the case p{trans{ng,ti,t2}) = k + 1. 

Because trans{nq,t\,t2) is in 1^3(1^), there is some rule trans{nq,t\,t2) <— 
^1, . . . Am in TTs such that all Ai, . . . are in Yf.. From the construction of tts, 
we have the following cases: 

— r is a rule of the form (88). Then, there exists qi,q2,t' such that 

sequence(nq,nq^,nqr^) G X, trans(nq^,ti,t') € Yk, and trans{nq^,t' ,12). 
Hence, p{trans{nq^,ti,t')) < k and p{trans{nq^,t' ,12)) < k. By in- 
ductive hypothesis, .s/.^ (M)otiSti+i(M) . . . at/_iSt'(M) is a trace of 
qi and st'{M)at'St'+i{M) . . . at2-iSi.2iM) is a trace of q2. Since 
sequence{nq,nqi,nq2) € X we know that q = qi;q2- By Definition 4.16, 
sti{M)atiSti+i{M) . . . at2-iSt2(M) is a trace of q. 

— r is a rule of the form (89). Then, choice Action{nq) is in X. So, q 
is a choice program, say q = qi \ q2... \ qi- In addition, there ex- 
ists I < j < I such that in{nq-,nq) € X and trans{nq. ,ti,t2) € Yfe. 
By the definition of p, p(trans{qj ,ti,t2)) < k. By inductive hypothesis, 
Sti {M)atiSti+i{M) . . . at2-iSt2{M) is a trace of qj. By Definition 4.16, it is also 
a trace of q. 

— r is a rule of the form (90). Then, by the construction of tts, there exists (j), qi, 
q2 such that if{nq, n^, Uq^ , Uq^) <E X, hf{ncf,, ti) G X, and trans{nq^ , ti,t2) G Yfc. 
Thus q is the program "if (p then qi else (72" and p{trans{nq-^,ti,t2)) < 
k. Again, by inductive hypothesis, St^{M)atiSi^^i{M) . . . at^-iSi^iM) is 
a trace of qi. Because of Theorem 4.9, 4) holds in s-t^{M). Hence, 
Stj^{M)atj^Stj^+i{M) . . . at^-iSt^iM) is a trace of q. 

— r is a rule of the form (91). Similarly to the previous items, we 

know that there exist (/), q, qi, and q2 such that ifiriq^n^^nq-^^nq^) G 
X, hf{n^,ti) ^ X, and trans{nq2,ti,t2) G Yk. This means that 
p{trans{nq2,t\,t2)) < k. Hence, by inductive hypothesis and Theorem 
4.9, (M)atjSfj+i(M) . . . ai2_i.Sf2 (M) is a trace of q2 and (j) is false in 
Sti{M), which mean that Sti{M)atiSti+i{M) . . .at2-iSt2{M) is a trace of 
"if (/) then qi else 92") i-e., a trace of q. 

— r is a rule of the form (92). This implies that there exist a formula </>, 
a program qi and a time point t' > ti such that while{nq,n^,nq^) G 
X and hf{n^,ti) G X, trans{nq^,ti,t') and trans{nq,t' ,12) are in Yk. 
It follows that q is the program "while (j) do qi\ Furthermore, (j) 
holds in .Sij(M), and (M)af jSt^+i(M) . . . a(/_iSt/ (M) is a trace of qi and 
St'(M)at/St'+i(M) . . . at2-iSt2{M) is a trace of q. By Definition 4.16, this im- 
plies that Sti{M)at^Sti+i{M) . . . at2-iSt2{M) is a trace of q. 
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— r a rule of the form is (94). Then, q has the form pick(X, {ci, . . . , c„}, gi). 
Therefore, choiceArgs{nq, {cj)) is in X for j = 1, ... , n. trans{nq, ti,t2) € Y 
impHes that there exists an integer j, 1 < J < n, such trans{ng^(^Cj),tijt2) G Yk- 
By the definition of p, p{trans{nq^{cj),ti,t2)) < k. By inductive hypothesis, 
sti{M)atiSti+i{M) . . .at2-iSt2iM) is a trace of program qi{cj). Thus, we can 
conclude that sti{M)atiSti+i{M) . . . at2-iSt2iM) is a trace of q. 

The above cases prove the inductive step for (*). The lemma follows immediately 

since trans{np,0,n) belongs to M. □ 

To prove the reverse of Lemma .3, we define a function p that maps each ground 
complex action q into an integer p{q) that reflects the notion of complexity of q 
(or the number of nested constructs in q). p{q) is defined recursively over the 

construction of q as follows. 

— For q = 4> and </) is a formula, or g = a and a is an action, p{q) = 0. 
— For q = qi;q2 ov q = if cf) then qi else 52, l^iq) = 1 + p{q\) + Kq^)- 
—For q = qi \ . . . \ qm, IJ-il) = 1 + max{/i(gi) | i = 1, . . . , m}. 

— For q = while (j) do gi, p{q) = 1 + p{qi)- 

—For q = pick(X, {ci, . . . ,c„},gi), p{q) = 1 + max{^(gi(cj)) \j = l,...,n}. 
— For q — p(ci,...,c„) where {p{Xi, . . . , Xn) : 61) is a procedure, p{q) = 1 + 
p{6i{ci, . . . ,c„)). 

It is worth noting that p{q) is always defined for programs considered in this paper. 

Lemma .4. Let {D,T) he a consistent action theory, p he a program, and 
soao . . . s„_ia„ he a trace of p. Then n^°'°^ has an answer set M such that 

— occ{ai,t) € M for < i < n ~ 1, 
— St = S((M) for every <t < n, and 
— trans{np,0,n) G M. 

Proof. We prove the lemma by constructing an answer set of n^°'°s that sat- 
isfies the conditions of the lemma. Similar to the proof of Lemma .3, we split 
YlGolog ygjjjg u ^ lit{'Kf) U r{p). This implies that M is an answer set of n^°'°s 
iff M = X U F where X is an answer set of 6(7(11^°'°^) and Y is an answer set 
of TTs = e(7(n^°'°9 \ 6(7(11^°'°^), X), which is the program consisting of the rules 
(86)-(95) with the corresponding conditions. 

Because sqUo ■ ■ .an-iSn is a trace of p, it is a trajectory in D. By Corollary .6, we 
know that tt/ has an answer set X' that satisfies the two conditions: 

— occ{ai, t) € X' for < i < n — 1 and 
— St = St{X') for every <t < n. 

Because r{p) consists of only rules and atoms encoding the program p, it is easy to 
see that there exists an answer set X of tt/ U r{p) such that X' C X. Clearly, X 
also satisfies the two conditions: 

— occ{ai, G X for < i < n — 1 and 
— St = St{X) for every <t <n. 
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Since tts is a positive program we know that 773 has a unique answer set, say Y. 
Prom the sphtting theorem, we have that M = X UY is an answer set of n^°'°f . 
Because st{X) = st{M), M satisfies the first two conditions of the lemma. It 
remains to be shown that M also satisfies the third condition of the lemma. We 
prove this by proving a stronger conclusion: 

(*) If is a program with the name Uq and there exists two integers ti and ^2 such 

that ,Sij(Af)aj^ . . . a/,,_i.st, (Af) is a trace of q then trans{nq,ti,t2) G M. (the 
states Si{M) = Si - see above - and the actions are defined as in the Lemma's 
statement) 

We prove (*) by induction over iJ.{q), the complexity of the program q. 
Base: ^{q) = 0. There are only two cases: 

— q = (j) for some formula ^, and hence, by Definition 4.16, we have that t2 = ti. It 
follows from the assumption that St^ (M) is a trace of q that (M) satisfies (j). 
By Theorem 4.9. hf{ncj,,ti) G X, and hence, we have that trans{n^,t\,t\) € Y 
(because of rule (87)). 

— q = a where a is an action. Again, by Definition 4.16, we have that f2 = ti + 1- 

From the assumption that Sti{M)at-^st2{M) is a trace of q we have that = a. 
Thus, occ{a,ti) G M. By rule (86) of tts, we conclude that trans{a,ti,t2) G Y, 
and thus, trans{a,ti,t2) € M. 

The above two cases prove the base case. 

Step: Assume that we have proved (*) for every program q with fi{q) < k. We 
need to prove it for the case fi{q) = k + 1. Because fi{q) > 0, we have the following 

cases: 

— Q = 92- By Definition 4.16, there exists t' , t\ < t' <t2, such that St^ati ■ ■ - Sf 
is a trace of gi and St'^t' ■■■St2 is a trace of 52- Because n{qi) < n{q) and 
^(52) < fJ-{q), by inductive hypothesis, wc have that trans{nq-^,ti,t') £ M and 
trans{nq2,t' ,12) € M. q = qi;q2 implies sequence{nq,nq^,nq2) G M. By rule 
(88), trans{nq,t\,t2) must be in M. 
— q = qi \ . . . \ q-i. Again, by Definition 4.16, s/^o/ j . . . at2-iSt2 is a trace of some qj. 
Since ii{qj) < iJ-{q)^ by inductive hypothesis, we have that trans{nq.,ti,t2) G M. 
Because of rule (89), trans{nq,ti,t2) is in M. 
— q = if (j) then qi else 92- Consider two cases: 

— (j) holds in st-^ . This implies that St^ {M)ai^ . . . af^-istg (M) is a trace of qi. Be- 
cause of Theorem 4.9, hf{n^,ti) G M. Since /u((/i) < /u((/), trans{nq^,ti,t2) G 
M by inductive hypothesis. Thus, according to rule (90), trans{nq,t\,t2) must 
belong to M . 

— 4> does not holds in st^ . This implies that St^ {M)ati ■ ■ ■ «t2-iSt2 (-^) is a trace of 
q2. Because of Theorem 4.9, hf{n^, ti) does not hold in M. Since /u(gi) < /u(g), 
trans{nq2 , ^1,^2) is in M by inductive hypothesis. Thus, according to rule (91), 
trans{nq,ti,t2) G M. 
— q = while </> do qi. We prove this case by induction over the length of the 

trace, t2 —h. 
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— Base: t2 — t\ = 0. This liappens only when does not hold in stj(M). As 

such, because of rule (93), trans{nq,t\,t2) is in M. The base case is proved. 
— Step: Assume that wc have proved the conclusion for this case for < t2~ii < 
I. We will show that it is also correct for t2 — t\ = I. Since t2 — t\ > 0, we 
conclude that (f) holds in st^ and there exists h <t' < t2 such that st^ati . ■ .Sf 
is a trace of qi and St'at' ■ . ■St2 is a trace of q. We have /i(gi) < /i(9); t' — ti < 
t2 — ti and t2 — t' < t2 — ti = I. By inductive hypothesis, trans{nq^ , ti,t') and 
trans{nq,t' ,t2) are in M. By Theorem 4.9, hf{n^,t\) is in M and from the 
rule (92), trans{nq,ti,t2) is in M. 
— q = pick(X, {ci, . . . , c„}, gi). So, there exists an integer j, I < j < n, 
such that the trace of 5 is a trace of qi{cj). Since iJ,{qi{cj)) < ^(g), 
we have that trans{nq^(^^.-^,ti,t2) G M. This, together with the fact that 
choiceArgs{nq,nq^(^c.)) G r(p), and the rule (94) imply that trans(ng,ti,t2) is in 



— q = p{ci, . . . ,Cn) for some procedure (p(Xi, . . . , Ar„), gi). This implies that 
Sti(M)ati . . . at2-iSt2{M) is a trace of gi(ci, . . . , c„). Since /x(gi(ci, . . . , c„)) < 
/i(g), we have that trans{nq^ , t\,t2) G M. Since rig^^ = p{c\, . . . , c„) and Uq = q, 
we have that trans{nq,ti,t2) G M. This proves the inductive step for this case 
as well. 

The above cases prove the inductive step of (*). The conclusion of the lemma 
follows. □ 

We now prove the Theorem 4.18. 

Theorem .6. Let {D,r) be a consistent action theory and p be a program. 
Then, 

(i) for every answer set M of n^°'°f with occ{ai,i) G M for i G {0, . . . , n — 1}, 
so(M)ao . . . a„_is„(M) is a trace of p; and 

(ii) if So«o • • • c-n-iSn is a trace of p then there exists an answer set M of 11^°^°^ 
such that Sj = Sj{M) and occ{ai, i) G M for j G {0, . . . , n} and i G {0, . . . , n— 1}. 

Proof, (i) follows from Lemma .3 and (ii) follows from Lemma .4. □ 
Appendix A. 4 - Proof of Theorem 4.22 

Let p now be a general program. To prove Theorem 4.22, we will extend the 
Lemmas .3-. 4 to account for general programs. Similarly to the proofs of Lemmas 
.3-. 4, we will split H^^'^ by the set U = lit{-Kf) U r{p). Thus M is an answer set 
of n^-^^ iff M = A U y where X is an answer set of tt/ U r(p) and Y is an answer 
set of the program eu (n^^^ \ 6c/ (n^^^), X), denoted by 774, which consists of the 
rules of program tt^ (with the difference that a program is now a general program) 
and the following rules: 



M. 



l{begin{n,i,t:i,ti, 




(96) 
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between{t3,ti,t2)}l ^ trans{n,ti,t2)- (97) 
(if htn{n, s, c) G X, in{i, s) S X) 

I{end{n,i,t3,ti,t2) : 

bet'ween{ts,ti,t2)}l <— trans{n,t\,t2)- (98) 

(if htn{n, s, c) G X, in{i, s) e X) 
used{n,t,ti,t2) •*— begin{n,i,b,ti,t2), (99) 
end{n, i, e, ti,t2), 
b<t<e. 

(if htn{n, s, c) e X, in{i, s) e X) 
notjased{n,t,t\,t2) <— not used{n,t,ti,t2)- (100) 
overlap{n,t,ti,t2) <— begin{n,ii,bi,ti,t2), (101) 
end{n, ii,ei,ti,t2), 
begin{n,i2,b2,ti,t2), 
end{n, Z2, 62, ti, t2), 
61 < f < ei, 62 < t < 62- 

(if htn{n,s,c) G X, in{ii,s) G X, in{i2,s) € X) 
nok{n,ti,t2) <— h > t4„begin{n,i,t3,ti,t2), (102) 

end{n, i, t^, ti, 12). 

(if htn{n, s, c) G X, m(i, s) G X) 
nok{n,ti,t2) <— ta < t4,begin{n,i,t3,ti,t2), (103) 

enrf(n, i,t4,fi,t2), 

not trans{i, t^, ti). 

(if htn{n, s, c) G X, ^(i, s) G X) 
nok{n,ti,t2) <— ti < t < t2,not_used{n,t,ti,t2)- (104) 

(if htn{n, s, c) G X) 
nok{n,ti,t2) <— ti < t < t2,overlap{n,t,ti,t2)- (105) 

(if htn{n, s, c) G X) 
nok{n,ti,t2) <— begin{n,ii,bi,ti,t2), (106) 

6e5m(n, 12, 62, ii,i2), 

61 > 62- 

(if htn{n, s,c) G X, in(ii,s) G X, in{i2,s) G X, 
m(o, c) G X, order {o,ii, 12) G X)) 
nok{n,ti,t2) <— end{n,ii,ei,ti,t2), (107) 

begin{n,i2,b2,ti,t2),ei < tz < 62- 
(if htn{n, s, c) G X, in{ii, s) G X, in(?'2, s) G X, 
m(o, c) G X, ■maintain{o, f,ii,i2) G X, 
and/i/(/,t3) ^X) 



ACM Transactions on Computational Logic, Vol. V, No. N, February 2008. 



Domain-Dependent Knowledge in Answer Set Planning 



App-21 



nok{n,ti,t2) <— begin(n,i,b,ti,t2),end{n,i,e,ti,t2), 
(if htn{n, s, c) € X, in{i, s) € X, 
in{o, c) G X,precondition{o, f,i)&X 
and hf if ,b) ^ X) 

nok{n,ti,t2) <— begin{n,i,b,ti,t2),end{n,i,e,ti,t2)- 
(if htn{n, s, c) G X, m(z, s) e X, 
m(o, c) e X, postcondition{o, f, i) € X, 
and hf if, e)^X) 

We will continue to use the complexity of program defined in the last appendix and 
extend it to allow the HTN-construct by adding the following to the definition of 

-For q = {S, C), ti{q) = 1 + ^p^st^(p)- 

Notice that every literal of the program 714 has the first parameter as a program^®. 
Hence, we can associate iJb{q) to each literal I of 774 where q is the first parameter 

of /. For instance. fj,{trans{q,ti,t2)) = fJ.{q) or ji{nok{q,t 1,12)) = fiil) etc.. Since 
we will continue to use splitting theorem in our proofs, the following observation is 
useful. 

Observation .7. The two cardinality constraint rules (97)-(98) can he replaced 
by the following normal logic program rules: 



(108) 



(109) 



begin{n,i,t,ti,t2) trans(ji,ti,t2), 

ti < t < t3 < t2, not nbegin{n, i, t, ti, <2)- 
nbegin{n,i,t,ti,t2) <— trans{n,ti,t2), 

ti <t<t2,ti <ts<t2,t^t3, begin{n, i, is, ti, t2)- 
end{n,i,t,ti,t2) ^ trans{n,ti,t2), 

ti < t < ti < t2, not nend{n, i, t, ii, ^2)- 
nend(n,i,t,ti,t2) ^ trans{n,ti,t2), 

ti <t < t2,ti <ts<t2,t^ t3,end{n,i,t3,ti,t2)- 

That is, let w* be the program obtained from -k^ by replacing the rules (97)- 

(98) with ike above set of rules. Then, M is an answer set of ir^ iff M' = 
M U {nbegin{n,i,t3,ti,t2) | begin{n,i,t,ti,t2) € M,t ^ i3,ii < t,t3 < ^2} U 
{nend{n,i,t3,ti,t2) \ t ^ ts, ti < t,ts < t2,end{n,i,t,ti,t2) € M} is an answer 

set of TT* . 

The next lemma generalizes Lemma .3. 

Lemma .8. Let q be a general program, Y be an answer set of the program, 
eu(n.!i^^^ \ bu(n.n^^), X) (i.e. program ni), and ti,t2 be two time points such 
that q 7^ null and trans{nq,ti,t2) G Y. Then, st^{M)at^st^+i{M) . . . at2-iSt2{M) 
is a trace of q where M = X UY . 



i^More precisely, a program name. 
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Proof. Let Uk ^ {I \ I e lit{TTi). fi{l) < k}. 

From observation .7, we know that wc can use the splitting theorem on 1^4. It is 
easy to see that {Uk)k<oo is a splitting sequence of 774. From the finiteness of 774 
and the sphtting sequence theorem, we have that Y = Ui<(x) where 

(1) lo is an answer set of the program bu^^ {774) and 

(2) for every integer i, l^+i is an answer set for {bui+i {t^a) \ but (^4)) Uj<i ^O- 
We prove the lemma by induction over iJ,{q). 

Base: ^{q) = 0. Prom trans{nq,t\,t2) € Y, we have that trans{nq,ti,t2) € Vo- It 
is easy to sec that bifg{TT4) consists of all the rules of tt4 whose program has level 0. 
It follows from Lemma .3 Sti{M)atiSti+i{M) . . . at2-iSt2{M) is a trace of g. The 
base case is proved. 

Step: Assume that we have proved the lemma for ^{q) = k. We prove it for 

li{q) = fc + 1. From the fact that trans{nq,ti,t2) G M and fJ.{nq) = fc + 1 we 
have that trans{nq,ti,t2) € Yk+i where Yfe+i is an answer set of the program 
^Uki^Uk+ii'^i) \ buk{''^i)i\Jj<k'^k) which consists of rules of the form (96)-(109) 
and (88)-(94) whose program has the level fc + 1, i.e., ii{q) = fc + 1. Because 
transiriq, ^1,^2) &Y we know that there exists a rule that supports transiriq, ti,t2)- 
Let r be such a rule. There are following cases: 

— r is a rule of the form (88)- (94), the argument is similar to the argument using 
in the inductive step for the corresponding case in Lemma .3. Notice a minor 
difference though: in Lemma .3, we do not need to use fi{q). 

— r is a rule of the form (96) , which implies that q — {S, C) where S is a set of 
programs and C is a set of constraints C . By definition of answer sets, we know 
that nok{nq,ti,t2) i^fe+i. Furthermore, because of the rules (97) and (98), the 
fact that trans{nq, ti, t2) G Yk+i and the definition of weight constraint rule, we 
conclude that for each qj G S there exists two numbers ji, and j'g, ti < jh,je < 
t2 such that begin{nq,nq.Jb,ti,t2) G Yfe+i and end{nq,nq^ , je,ti,t2) G Yfe+i. 
Because of rule (103), we conclude that trans{nq. ,jb,je) G Ui<fe^j- Otherwise, 
we have that nok{nq,ti,t2) G ^fc+i, and hence, trans{nq,ti,t2) ^ ^fe+i, which 
is a contradiction. By definition of /!((/), we have that /x(gj) < iJL(q). Thus, by 
inductive hypothesis, wc can conclude that: for every qj G 5*, there exists two 
numbers jb and je, ii < J&, je < ^2, Sj^{M)aj^ . . . aj^^iSj^{M) is a trace of qj. 

Furthermore, rules (99)-(105) imply that the set {jb \ qj € S} creates a permu- 
tation of {1, ... , 15*1} that satisfies the first condition of Definition 4.20. 

Consider now an ordering qj^ -< qj^ in C. This implies that the body of rule 
(106) will be satisfied li jbi > ib2 which would lead to trans{nq,ti,t2) ^ ifc+i. 
Again, this is a contradiction. Hence, we must have jb^ < jb^ that means that 
the permutation {jb \ qj G S} also satisfies the second condition of Definition 
4.20. 

Similarly, using (107)-(109) wc can prove that the permutation {jb \ qj G 5} also 
satisfies the third and fourth conditions of Definition 4.20. 

It follows from the above arguments that (M)ati • • • ota-iSta {M) is a trace of 
q). The inductive step is proved for this case. 
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The above cases prove the inductive step. This concludes the lemma. □ 

In the next lemma, we generalize the Lemma .4. 

Lemma .9. Let (-D,r) be a consistent action theory, p he a general program, and 
Socto • • -an-iSn he a trace of p. Then, there is an answer set M ofli^^^ such that 
Si{M) = Si and occ{ai,i) € M and trans{np,0,n) e M. 

Proof. Based on our discussion on splitting H^^^ using lit{TTf) U r{q) and the 
fact that sqUq . . . ttn-iSn is also a trace in f, we know that there exists an answer 
set X of TT U r{p) such that Si{X) = s, and occ{ai,i) £ X. Thus, it remains to 
be shown that there exists an answer set Y of 7r4 such that trans{np,0,n) G Y. 
Similar to the proof of Lemma .8, we use {Uk)k<oc as a splitting sequence of 774 
where Uk = {u \ u € lit{TT4), l-i{u) < k}. Prom the splitting sequence theorem, we 
have that Y = Ui<oo ^« where 

(1) Yq is an answer set of the program bu„{n4) and 

(2) for every integer i, Yi-^-i is an answer set for eUiii'Ui^ii'n'i) \ ^!7i (^4), Uj<i ^j)- 

We prove the lemma by induction over Similar to Lemma .4, we prove this 

by proving a stronger conclusion: 

(*) There exists an answer set Y = Ui<oo ""4 such that for every program q ^ 
null occurring in p, stittt^ . . . at^-ist^ is a trace of q iff trans{nq,ti,t2) € ^^(g)- 
(the states Si and the actions are defined as in the Lemma's statement) 

We will prove (*) by induction over iJ,{q). 

Base: = 0. Similar to the base case in Lemma .4 . 

Step: Assume that we have proved (*) for ^{q) < k. We need to prove (*) 

for ii{q) ~ k + 1. We will construct an answer set of 7r+ = e^/^ (6f/^^^ (7r4) \ 
buki'^i): Uj<fe ^k) such that for every program q occiuTing in p with fj,{q) = k + 1, 
if StiOtj . . .at2-iSt2 is a trace of g then trans{nq,ti,t2) S Ifc+i- 

Let Vfe+i be the set of atoms defined as follows. 

— Por every program q with ij,{q) = k + 1, if g' is not of the form {S,C) and 
Stittti ■ ■ .at2-iSt2 is a trace of q, Yfe+i contains trans{nq,ti,t2)- 

— Por every program q with n{q) = k + 1, q = {S,C), and St^ati ■ ■ .aj^-iStj is a 
trace of q. By definition, there exists a permutation {ji, . . . ,j\s\} of {1, ... , \S\} 
satisfying the conditions (a)-(d) of Item 8 (Definition 4.20). Consider such a 
permutation. To simplify the notation, let us denote the begin- and end-time of 
a program qj G 5 in the trace of q by bj and ej, respectively, i.e., 81,^0,1,^ ■ ■ ■ Sg. is 
a trace of qj. Then, Yk+i contains trans{nq,ti,t2) and the following atoms: 

(1) begin{nq, Uq. , bj,ti,t2) for every qj G S, 

(2) end{nq,nq^,ej,ti,t2) for every qj e S, and 

(3) used{nq, t, fi, ^2) for for every qj S S and bj < t < Cj . 

— Vfe+i does not contain any other atoms except those mentioned above. 

It is easy to see that Yfe+i satisfies (*) for every program q with ^{q) = k+1. Thus, 
we need to show that Yk+i is indeed an answer set of 7r+. Pirst, we prove that Yfe+i 
is closed under (7r+)^'=+^. We consider the following cases: 
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— r is a rule of the form (88). Obviously, if r belongs to (7r+)^'=+i, then q = qi;q2 
and there exists a. ti < t' < such that trans{nq^,t\,t') and trans{nq^,t' ,12) 
belong to ljj<fc ^fc because ^i{qi) < i^{q) and ^i{q2) < fJ-iq)- By inductive hy- 
pothesis, Strati . . .St' is a trace of qi and St'ttf ... 5*2 is a trace of q2- By Def- 
inition 4.16, Stittti ■ ■ - St^ is a trace of q. By construction of Ffc+i we have that 
trans{nq,ti,t2) € Yk+i- This shows that Yk+i is closed under r. Similar argu- 
ments conclude that Yfc+i is closed under the rule of the form (89)-(94). 

— r is a rule of the form (96) of (7r+)^''+i . Then, q = {S, C) and by construction of 
Yk+i; if SfjAti . . . is a trace of q then we have trans{nq,ti,t2) € Ifc+i. Thus. 
Yk+i is closed under the rules of the form (96) too. 

— r is a rule of the form (97) and (98). Y/j+i is also closed under r because whenever 
trans{nq,ti,t2) S Yk+i, we now that there is a trace st-^at-^ ■ ■ ■ st^ of q, and 
hence, by Definition 4.20, we conclude the existence of the begin- and end-time 
points bj and ej of qj, respectively. By construction of Yfe+i, we have that 
begin(nq,nq^,bj,ti,t2) and end{nq,nq^,ej,ti,t2) belong to Yfe+i and for each qj, 
there is a unique atom of this form in Yk+i. Hence, Yfe+i is closed under rules of 
the form (97) and (98). 

— r is a rule of the form (100)-(109). The construction of Ffe+i ensures that the 
body of r is not satisfied by Ife+i, and hence, Yfe+i is closed under r. 

— r is a rule of the form (99). Because used{nq,t,ti,t2) belongs to Yk+i for every 
t, ti < t < t2- we have that Yk+i is closed under r too. 

The conclusion that Y^+i is closed under (tt^)^*^*! follows from the above cases. 

To complete the proof, we need to show that Y^+i is minimal. Assume the contrary, 
there exists a proper subset Y' of Yfc+i such that Y' is closed under (7r"'")^'=+i . Let 
u e Y/s-i-i \ Y'. Since u £ Yfc+i, we have the following cases: 

— u is the head of a rule of the form (88)- (94). By definition of tt"*", we know that 
a rule of this form belongs to tt"*" iff its body is empty. Thus, from the closeness 
of Y' we have that u €Y'. This contradicts the fact that u ^Y'. 

— u is the head of a rule of the form (96) . Similar to the above case, we can conclude 
that u € Y' which again contradicts the fact that u ^Y'. 

— u is the head of a rule r of the form (97). Because of u G Yk+i we conclude that 
trans{nq,ti,t2) £ Yk+i. The above case concludes that trans{nq,ti,t2) G Y'. 
Since the body of r is true, we conclude that there exists some qj G S such that 
Y' does not contain an atom of the form begin{nq, Uq. , bj,t\,t2)- Thus, Y' is not 
closed under r. This contradicts the assumption that Y' is closed under (7r"'")^'=+i . 

— u is the head of a rule r of the form (98). Similar to the above case, we can prove 
that it violates the assumption that Y' is closed under (7r"'")^'=+i. 

— u is the head of a rule r of the form (99). Because u G Yk+\ we know that the 
body of r is satisfied by Yk+i, and hence, r belongs to (7r+)^'=+i. Again, because 
of the closeness of Y' , we conclude that u &Y' which violates the assumption 
that u ^ Y'. 

The above cases imply that Y' is not closed under (7r+)^'=+i. Thus, our assumption 
that Yk+i is not minimal is incorrect. Together with the closeness of Vfe+i, we have 

ACM Transactions on Computational Logic, Vol. V, No. N, February 2008. 



Domain-Dependent Knowledge in Answer Set Planning 



App-25 



that Yft+i is indeed an answer set of tt"*". The inductive step is proved since Yfe+i 
satisfies (*) for every program q with ii{q) = A; + 1. This proves the lemma. □ 

Theorem . 10. Let {D, T) be a consistent action theory and p be a general pro- 
gram. Then, 

(i) for every answer set M of 11^-^^ with occ{ai,i) G M for i G {0, . . . ,n — 1}, 
SQ{M)aQ . . . an-iSn{M) is a trace of p; and 

(ii) if sqUo ■ ■ ■ ttn-iSn is a trace of p then there exists an answer set M of n^"^^ 
such that Sj = Sj (M) and occ{ai,i) G M for j e {0, . . . , n} and i G {0, . . . , n— 1} 

and trans{np,0,n) G M. 

Proof, (i) follows from Lemma .8 and (ii) follows from Lemma .9. □ 
Appendix B - Splitting Theorem 

In this appendix, we review the basics of the Splitting Theorem [Lifschitz and 
Turner 1994]. Because programs in this paper do not contain classical negations, 
some of the definitions have been modified from the original presentation in [Lifs- 
chitz and Turner 1994]. 
Let r be a rule 

ao <— ai, . . . , am, not a,n+i, . . . , a„. 

By head{r), body{r), and lit(r) we denote ao, {ai,...,a„}, and {ao, ai, . . . , a„}, 
respectively. pos{r) and neg{r) denote the set {ai, . . . , am} and {am+i, ■ ■ ■ , dn}, 
respectively. 

For a program II over the language CV, a set of atoms of CP, A, is a splitting set 
of n if for every rule r G II, if head{r) S A then Ut{r) C A. 

Let A be a splitting set of II. The bottom o/II relative to A, denoted by 6a (H), is 
the program consisting of all rules r G II such that lit{r) C A. 

Given a splitting sot A for II, and a set X of atoms from lit{bA{^)), the partial eval- 
uation o/n by X with respect to A, denoted by 6^(11, X), is the program obtained 
from n as follows. For each rule r G II \ 6^(11) such that 

(1) pos{r)nAC X; 

(2) neg{r) n ^ is disjoint from X; 

we create a rule r' in 6^(11, X) such that 

(1) head{r') = head{r), and 

(2) pos{r') = pos{r) \ A, 

(3) neg{r') — neg{r) \ A. 

Let ^ be a splitting set of II. A solution to II with respect io ^ is a pair {X, Y) of 
sets of atoms satisfying the following two properties: 

(1) X is an answer set of 6^(11); and 

(2) Y is an answer set of 6^(11 \ bA^O), X); 

The splitting set theorem is as follows. 
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Theorem .11 Splitting Set Theorem, [Lifschitz and Turner 1994]. 
Let A be a splitting set for a program 11. A set S of atoms is a consistent answer 
set of U ijf S = X UY for some solution {X, Y) to H with respect to A. □ 

A sequence is a family whose index set is an initial segment of ordinals {a | a < 
/Lt}. A sequence (Aa)a<M of sets is monotone if Aa C Af3 whenever a < jS, and 
continuous if, for each limit ordinal a < fi, A^ = [j^^a A^. 

A splitting sequence for a program 11 is a nonempty, monotone, and continuous 
sequence of splitting sets of 11 such that Zif(n) = Ua</n^a- 

Let {Aa)a<fi be a splitting sequence of the program 11. A solution to H with respect 
to ^4 is a sequence (-Eq)q<;j of set of atoms satisfying the following conditions. 

(1) i^o is an answer set of the program 6Ao(n); 

(2) for any a such that a + 1 < /it, Ea+i is an answer set for eA„(&A<:,+i(n) \ 

&A„(n),u^<„i^7); and 

(3) For any limit ordinal a < /x, i?Q = 0. 

The splitting set theorem is generalized for splitting sequence next. 

Theorem .12 Splitting Sequence Theorem, [Lifschitz and Turner 1994]. 
Let A = {Ac,)a<p, be a splitting sequence of the program 11. A set of atoms E is an 
answer set ofU iff E = Uc«</i-^« Z^'' some solution {Ea)a<tJ. to 11 with respect to 
A. □ 

To apply Theorems .11-. 12 to programs with constraints of the form (6), we need 
to modify the notation of the bottom of a program relative to a set of atoms as 
follows. 

Let n = Hi U 112 be a program with constrains where IIi is a set of rules of the 
form (5) and 112 is a set of rules of the form (6). For a splitting set A of 11, we 
define 6A(n) = ^^(ni) U CA(Ji-2) where CA{Ii-2) = {r | r e n2, lit{r) C A}. 

We can prove that Theorems .11-. 12 hold for programs with constraints. For 
example, if A is a splitting of the program H, then S is an answer set of 11 iff A = 
XuY where X is an answer set of 6^(11) and Y is an answer set of eA(n\6A(n), X). 
The proof of the modified theorems is based on two observations: (i) a set A of 
atoms from litCn) is a splitting set of 11 iff it is a splitting set of IIi (because 
_L ^ /it(n)); and (ii) a set of atoms S is an answer set of 11 iff 5 is answer set of Hi 
and S satisfies the rules of 112. 
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